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In this paper we give a complete analysis of the phase transitions 
in the mean-field Blume-Emery-GrifBths lattice-spin model with re- 
spect to the canonical ensemble, showing both a second-order, con- 
tinuous phase transition and a first-order, discontinuous phase tran- 
sition for appropriate values of the thermodynamic parameters that 
define the model. These phase transitions are analyzed both in terms 
of the empirical measure and the spin per site by studying bifurca- 
tion phenomena of the corresponding sets of canonical equilibrium 
macrostates, which are defined via large deviation principles. Analo- 
gous phase transitions with respect to the microcanonical ensemble 
are also studied via a combination of rigorous analysis and numerical 
calculations. Finally, probabilistic limit theorems for appropriately 
scaled values of the total spin are proved with respect to the canonical 
ensemble. These limit theorems include both central-limit-type theo- 
rems, when the thermodynamic parameters are not equal to critical 
values, and noncentral-limit-type theorems, when these parameters 
equal critical values. 

1. Introduction. The Blume-Emery-Griffiths (BEG) model [4] is an im- 
portant lattice-spin model in statistical mechanics. It is one of the few and 
certainly one of the simplest models known to exhibit, in its mean- field ver- 
sion, both a continuous, second-order phase transition and a discontinuous, 
first-order phase transition. Because of this property, the model has been 
studied extensively as a model of many diverse systems, including He^-He"* 
mixtures — the system for which Blume, Emery and Griffiths first devised 
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their model [4] — as well as solid-liquid-gas systems [18, 24, 25], microemul- 
sions [23], semiconductor alloys [19] and electronic conduction models [17]. 
Phase diagrams for a class of models including the Blume-Emery-Griffiths 
model are discussed in [1], which lists additional work on this and related 
models. On a more theoretical level, the BEG model has also played an 
important role in the development of the renormalization-group theory of 
phase transitions of the Potts model; see [16, 20] for details and references. 

As a model with a simple description but a relatively complicated phase 
transition structure, the BEG model continues to be of interest in mod- 
ern statistical mechanical studies. In this paper we focus on the mean-field 
version of the BEG model or, equivalently, the BEG model on the complete 
graph on n vertices. Our motivation for revisiting this model was initiated by 
a recent observation in [2, 3] that the BEG model on the complete graph has 
nonequivalent microcanonical and canonical ensembles, in the sense that it 
exhibits microcanonical equilibrium properties having no equivalent within 
the canonical ensemble. This observation is supported in [15] by numerical 
calculations both at the thermodynamic level, as in [2, 3], and at the level 
of equilibrium macrostates. In response to these earlier works, in this paper 
we address the phase transition behavior of the model by giving separate 
analyses of the structure of the sets of equilibrium macrostates for each of 
the two ensembles. Not only are our results consistent with the findings in 
[2, 3, 15], but also we rigorously prove for the first time a number of results 
that significantly generalize those found in these papers, where they were de- 
rived nonrigorously. For the canonical ensemble, full proofs of the structure 
of the set of equilibrium macrostates are provided. For the microcanonical 
ensemble, full proofs could not be attained. However, using numerical meth- 
ods and following an analogous technique used in the canonical case, we also 
analyze the structure of the set of microcanonical equilibrium macrostates. 

The BEG model that we consider is a spin-1 model defined on the com- 
plete graph on n vertices l,2,...,n. The spin at site j € {l,2,...,n} is 
denoted by uj, a quantity taking values inA = { — 1,0,1}. The Hamiltonian 
for the BEG model is defined by 

n K ( V 

i=l Vi=l / 

where > is a given parameter representing the interaction strength and 
Lo = (wi, . . . , uJn) £ A". The energy per particle is defined by 

(1.1) = -HnM = 5^ -k(^^]\ 

n n \ n J 

In order to analyze the phase transition behavior of the model, we first 
introduce the sets of equilibrium macrostates for the canonical ensemble 
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and the microcanonical ensemble. As we will see, the canonical equilibrium 
macrostates solve a two-dimensional, unconstrained minimization problem, 
while the microcanonical equilibrium macrostates solve a dual, one-dimensional, 
constrained minimization problem. The definitions of these sets follow from 
large deviation principles derived for general models in [10]. In the partic- 
ular case of the BEG model, they are consequences of the fact that the 
BEG-Hamiltonian can be written as a function of the empirical measures of 
the spin random variables and that, according to Sanov's theorem, the large 
deviation behavior of these empirical measures is governed by the relative 
entropy. 

We use two innovations to analyze the structure of the set of canon- 
ical equilibrium macrostates. The first is to reduce to a one-dimensional 
problem the two-dimensional minimization problem that characterizes these 
macrostates. This is carried out by absorbing the noninteracting component 
of the energy per particle function into the prior measure, which is a prod- 
uct measure on configuration space. This manipulation allows us to express 
the canonical ensemble in terms of the empirical means, or spin per site 
Sn/n = J2]j=i ^j/n, of the spin random variables. Doing so reduces the anal- 
ysis of the BEG model to the analysis of a Curie- Weiss-type model [9] with 
single-site measures depending on f3. 

The analysis of the set of canonical equilibrium macrostates is further 
simplified by a second innovation. Because the thermodynamic parameter 
that defines the canonical ensemble is the inverse temperature /3, a phase 
transition with respect to this ensemble is defined by fixing the Hamiltonian- 
parameter K and varying /3. Our analysis of the set of canonical equilibrium 
macrostates is based on a much more efficient approach that fixes f3 and 
varies K. Proceeding in this way allows us to solve rigorously and in complete 
detail the reduced one-dimensional problem characterizing the equilibrium 
macrostates. We then extrapolate these results obtained by fixing /5 and 
varying K to physically relevant results that hold for fixed K and varying 
f3. These include a second-order, continuous phase transition and a first- 
order, discontinuous phase transition for different ranges of K. 

For the microcanonical ensemble, we use a technique employed in [2] that 
absorbs the constraint into the minimizing function. This step allows us 
to reduce the constrained minimization problem defining the microcanoni- 
cal equilibrium macrostates to another minimization problem that is more 
easily solved. Rigorous analysis of the reduced problem being limited, we 
rely mostly on numerical computations to complete our analysis of the set 
of equilibrium macrostates. Because the thermodynamic parameter defining 
the microcanonical ensemble is the energy per particle u, a phase transi- 
tion with respect to this ensemble is defined by fixing K and varying u. 
By analogy with the canonical case, our numerical analysis of the set of 
microcanonical equilibrium macrostates is based on a much more efficient 
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approach that fixes u and varies K. The analysis with respect to K rather 
than u aUows us to solve in some detail the reduced problem characterizing 
the equilibrium macrostates. We then extrapolate these results obtained by 
fixing u and varying K to physically relevant results that hold for fixed K 
and varying u. As in the case of the canonical ensemble, these include a 
second-order, continuous phase transition and a first-order, discontinuous 
phase transition for diff'erent ranges of K. 

The contributions of this paper include a rigorous global analysis of the 
first- order phase transition in the canonical ensemble. Blume, Emery and 
Griffiths did a local analysis of the spin per site to show that their model 
exhibits a second-order phase transition for a range of values of K and that, 
at a certain value of K, a tricritical point appears [4]; a similar study of a 
related model is carried out in [5, 6]. This tricritical point has the property 
that, for all smaller values of K, we are dealing with a first-order phase 
transition. Mathematically, the tricritical point marks the beginning of the 
failure of the local analysis; beyond this point, one has to resort to a global 
analysis of the spin per site. While the first-order phase transition has been 
studied numerically by several authors, the present paper gives the first 
rigorous global analysis. 

Another contribution is that we analyze the phase transition for the 
canonical ensemble both in terms of the spin per site and the empirical 
measure. While all previous studies of the BEG model, except for [15], fo- 
cused only on the spin per site, the analysis in terms of the empirical measure 
is the natural context for understanding equivalence and nonequivalence of 
ensembles [15]. 

A main consequence of our analysis is that the tricritical point — the crit- 
ical value of the Hamiltonian parameter K at which the model changes its 
phase transition behavior from second-order to first-order — differs in the two 
ensembles. Specifically, the tricritical point is smaller in the micro canonical 
ensemble than in the canonical ensemble. Therefore, there exists a range of 
values of K such that the BEG model with respect to the canonical ensem- 
ble exhibits a first-order phase transition, while, with respect to the micro- 
canonical ensemble, the model exhibits a second-order phase transition. As 
we discuss in Section 5, these results are consistent with the observation, 
seen numerically in [15], that there exists a subset of the microcanonical 
equilibrium macrostates that are not realized canonically. This observation 
implies that the two ensembles are nonequivalent at the level of equilibrium 
macrostates. 

A final contribution of this paper is to present probabilistic limit theo- 
rems for appropriately scaled partial sums Sn = Sj=i '^j with respect to 
the canonical ensemble. These limits follow from our work in Section 3 
and known limit theorems for the Curie- Weiss model derived in [12, 14]. 
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They include conditioned limit theorems when there are multiple equilib- 
rium macrostates representing coexisting phases. In most cases the limits 
involve the central-limit-type scaling n}!'^ and convergence in distribution 
of [Sn — nz) /n^/^ to a normal random variable, where z is an equilibrium 
macrostate. They also include the following two nonclassical cases, which 
hold for appropriate critical values of the parameters defining the canonical 
ensemble: 

Sn/n^^^ X where P{X G dx} = const • exp[— const • x^] dx 

and 

Sn/n^^^ — — > X where P{X € dx} = const • exp[— const • x^] dx. 

As in the case of more complicated models, such as the Ising model, these 
nonclassical theorems signal the onset of a phase transition in the BEG 
model ([9], Section V.8). They are analogues of a result for the much simpler 
Curie- Weiss model ([9], Theorem V.9.5). 

The outline of the paper is as follows. In Section 2, following the general 
procedure described in [10], we define the canonical ensemble, the micro- 
canonical ensemble and the corresponding sets of equilibrium macrostates. 
In Section 3 the structure of the set of canonical equilibrium macrostates 
is studied. The initial analysis is carried out in Sections 3.2 and 3.3 at the 
level of the spin per site Sn/n after the BEG model is written as a Curie- 
Weiss-type model in Section 3.1. In Sections 3.4 and 3.5 the information at 
the level of the spin per site is lifted to the level of the empirical measures of 
the spin random variables using the contraction principle, a main tool in the 
theory of large deviations. In Section 4 we present new theoretical insights 
into, and numerical results concerning, the structure of the set of micro- 
canonical equilibrium macrostates. In Section 5 we discuss the implications 
of the results in the two previous sections concerning the nature of the phase 
transitions in the BEG model, which in turn is related to the phenomenon of 
ensemble nonequivalence at the level of equilibrium macrostates. Section 6 
is devoted to probabilistic limit theorems for appropriately scaled sums Sii . 

2. Sets of equilibrium macrostates for the two ensembles. The canonical 
and micro canonical ensembles are defined in terms of probability measures 
on a sequence of probability spaces (A",^„). The configuration spaces A" 
consist of microstates uj = {uJi, . . . ,uJn) with each loj £ A = {—1,0, 1}, and 
J-n is the cj-field consisting of all subsets of A*^. We also introduce the n- 
fold product measure Pn on J7„ with identical one-dimensional marginals 
p = 1(6-1 + 6o + di). 

In terms of the energy per particle hn,K defined in (1.1), for each n G N, 
/3 > and K > 0, the partition function is defined by 

Zn{P,K)= ex.p[-nf3hn,K]dPn. 
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For sets B G the canonical ensemble for the BEG model is the probability 
measure 

(2.1) Pn,p,KiB) = ^ .\ ^, • / exp[-n/3/i„,x] dPn. 

For uEM, r>0, -R'>0 and sets B G the micro canonical ensemble is the 
conditional probability measure 

pn,r,i^(^) = P,,{B\K^K e[u-r,u + r]} 

(2.2) 

^ Pn{B n {hn,K e[u-r,u + r]}} 
Pn{hn,K ^[u- r,u + r]} 

As we point out after (2.4), for appropriate values of u and all sufficiently 
large n, the denominator is positive and, thus, P^'^'^ is well defined. 

The key to our analysis of the BEG model is to express both the canonical 
and the microcanonical ensembles in terms of the empirical measure L„ 
defined for a; € A" by 

1 " 
n ^ 

Ln takes values in 'P(A), the set of probability measures on A = {—1,0,1}. 
For i E A, Ln{ijJ-,{i}) denotes the relative frequency of spins ujj taking the 
value i. We rewrite hn,K as 

n \ n J 

and, for fi £ 'P(A), we define 

(2.3) ^'^ 

The range of this function is the closed interval [min(l — K, 0), 1]. In terms 
of fx, we express hn^x in the form 

hn,K{uj) = /ft'(L„(a;)). 

We appeal to the theory of large deviations to define the sets of canonical 
equilibrium macrostates and microcanonical equilibrium macrostates. Since 
any fi e 'P(A) has the form J2i=-i fJ-i^i^ where /ij > and J2i=-i i^i = 1> T^i^) 
can be identified with the set of probability vectors in M'^. We topologize 



PHASE TRANSITIONS IN THE MEAN-FIELD BEG MODEL 



7 



■p(A) with the relative topology that this set inherits as a subset of M^. The 
relative entropy of ;U € P(A) with respect to p is defined by 

1 

i=-l 

Sanov's theorem states that, with respect to the product measures Pm the 
empirical measures satisfy the large deviation principle (LDP) on V{A) 
with rate function R{-\p) ([9], Theorem VIII. 2.1). That is, for any closed 
subset F of 'P(A), we have the large deviation upper bound 

lim sup — log P„{L„ G F} < — inf R{p\p), 

and, for any open subset G of 'P(A), we have the large deviation lower bound 

limsup — logP„{L„ € G} > — inf R{p\p). 

n^oo n fieG 

From the LDP for the P„-distributions of -L„, we can derive the LDPs of 
Ln with respect to the two ensembles Pn,i3,K and P^'^'^ . In order to state 
these LDPs, we introduce two basic thermodynamic functions, one associ- 
ated with each ensemble. For /? > and X > 0, the basic thermodynamic 
function for the canonical ensemble is the canonical free energy 

^K{fi) = - lim -log Zn{f3,K). 

n — 'too fi 

It follows from Theorem 2.4(a) in [10] that this limit exists for all /3 > and 
K > and is given by 

^Km= mimp\p)+pfM}- 

MG-P{A) 

For the micro canonical ensemble, the basic thermodynamic function is the 
microcanonical entropy 

(2.4) sk{u) = - mf{R{p\p) : ^ G P(A), /a'(m) = u}. 

Since R{p\p) > for all p, sk{u) € [— oo,0] for all u. We define domsK to 
be the set of u G M for which sk{u) > —oo. Clearly, domsx coincides with 
the range of fx on V{A), which equals the closed interval [min(l — K,0),1]. 
For u € domsK and all sufficiently large n, the denominator in the second 
line of (2.2) is positive and, thus, the microcanonical ensemble P"'*"'^ is well 
defined ([10], Proposition 3.1). 

The LDPs for L„, with respect to the two ensembles are given in the next 
theorem. They are consequences of Theorems 2.4 and 3.2 in [10]. 
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Theorem 2.1. (a) With respect to the canonical ensemble Pn,i3.K, the 
empirical measures Ln satisfy the LDP on V{A) with rate function 

(2.5) Ip,K{f^) = R{^i\p) + PfM - mW). 

(b) With respect to the microcanonical ensemble P^'**'^, the empirical 
measures Ln satisfy the LDP on V{A), in the double limit n — > oo and r — > 0, 
with rate function 

(2.6) jn,K^ ^fR{p\p)+SK{u), iffK{p)=U, 

\ CX3, otherwise. 



For fj, G V and e > 0, we denote by B(p,e) the closed ball in V with 
center p and radius e. If Ip{fJ,) > 0, then for all sufficiently small e > 0, 
iiifi/GB(/i,e) -^/^(m) > 0- Hence, by the large deviation upper bound for L„, with 
respect to the canonical ensemble, for all p £ Pi^) satisfying Ip{p.) > 0, all 
sufficiently small e > and all sufficiently large n, 

Pn,l3,K{Ln S S(/i,e)} < CXp 

which converges to exponentially fast. Consequently, the most probable 
macrostates i' solve /^^/^(z^) = 0. It is therefore natural to define the set of 
canonical equilibrium macrostates to be 

(2.7) 

= {u e V{A) : u minimizes R{i^\p) + /3/k(i^)}. 

Similarly, because of the large deviation upper bound for L„ with respect to 
the microcanonical ensemble, it is natural to define the set of microcanonical 
equilibrium macrostates to be 

S""'^ = {ueP{A):r'^{u)=0} 

(2.8) 

= {z^ G P{A) : p minimizes R{v\p) subject to /if (z^) = u}. 

Each element v in Sp^K and g'^^^ has the form v = v_i5„i + z^q'^o + ^i^i and 
describes an equilibrium configuration of the model in the corresponding 
ensemble. For i = — 1, 0, 1, Vi gives the asymptotic relative frequency of spins 
taking the value i. 

In the next section we begin our study of the sets of equilibrium macrostates 
for the BEG model by analyzing Sp^K- 
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3. Structure of the set of canonical equilibrium macrostates. In this sec- 
tion we give a complete description of the set f/3,A' of canonical equilibrium 
macrostates for all values of (3 and K. In contrast to all other studies of the 
model, which fix K and vary (3, we analyze the structure of Sf^^K by fixing 
(3 and varying K. As stated in Theorems 3.1 and 3.2, there exists a critical 
value of P, denoted by (3c and equal to log 4, such that Sfs^K has two different 
forms for < /? < /3c and for P > (3c- Specifically, for fixed < (3 < Pc, £i3,K ex- 

(2) 

hibits a continuous bifurcation as K passes through a critical value Kc {(3), 
while for fixed P > Pc, ^p,K exhibits a discontinuous bifurcation as K passes 

through a critical value Kc^\p). In Section 5 we show how to extrapolate 
this information to information concerning the phase transition behavior 
of the canonical ensemble for varying /3: a continuous, second-order phase 
transition for all fixed, sufficiently large values of K and a discontinuous, 
first-order phase transition for all fixed, sufficiently small values of K. 
In terms of the uniform measure p = +60 + 61), we define 

(3.1) Pisidujj) = ■ exp{-Puj])p{dujj), 

where Z{P) = J^ex.p{—Puj'j)p{du;j). The next two theorems give the form 
of £f3^K ior < P < Pc and for P > Pc- Theorem 3.1 will be proved in Sec- 
tion 3.5 as a consequence of Theorem 3.6, which is proved in Section 3.2. 



Theorem 3.1. Define Pc = log 4 and let pjs be the measure defined in (3.1). 
For < P < Pc, the following conclusions hold: 

(2) 

(a) There exists a critical value Kc (/?) > defined in (3.19) and having 
the following properties: 

(i) ForO<K< kP{P), £f3,K = {Pfs}- 

(ii) For K > Kc (P), there exist probability measures v^{P,K) and 
iy~{P,K) inV{A) such that u+ {P, K) ^ v~ {P, K) pp and£/3 k = {j^^{P,K), 
i^-{P,K)}. 

(b) If we write iJ^{P, K) = ^(^^6 -1 + 1^^60 + 1^^ 61, then v~ {P, K) = vf 6-1 + 
h'Q 60 + 1^^51. 

(c) For each choice of sign, u^{P,K) is a continuous function for K > Kc^^ (/3) , 
and as — > {Kc'^\p))~^ , v^{P,K) pp. Therefore, £p,K exhibits a contin- 
uous bifurcation at Kc {P) ■ 

The continuous bifurcation described in part (c) of the theorem is an ana- 
logue of a second-order phase transition and explains the superscript 2 on 
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the critical value Kc (/?)• The next theorem shows that, for (3 > /3c, the set 
£l3^K exhibits a discontinuous bifurcation at a value Kc^\l3). This analogue 
of a first-order phase transition explains the superscript 1 on the correspond- 
ing critical value Kc^\(3). Theorem 3.2 will be proved in Section 3.5 as a 
consequence of Theorem 3.8, which is proved in Section 3.3. As we will see 
in the proof of the latter theorem, Kc^^ is the unique zero of the function 
A{K) defined in (3.31) for K > Ki{P); Ki{(3) is specified in Lemma 3.9. 

Theorem 3.2. Define [3c = log4 and let pp be the measure defined in 
(3.1). For P > (3c, the following conclusions hold: 

(a) There exists a critical value Kc^^ {(3) > having the following proper- 
ties: 

(i) For 0<K< K^^\(3), £p^K = {pp}- 

ill) For K = Kc^\l3) , there exist probability measures i'~^(j3, Kc^\p)) 
and iy~ {P , Kc^\p)) such that v'^^v"^pp and Sp^K = {Pf3-,i^~^{P,Kc^\p)), 

u-{f3,KP{m- 

(iii) For K > Kc^\l3) , there exist probability measures v^{(3,K) and 
V- iP, K) such that v+ (/?, K) / y- (/3, K) / and Ep^x = {^^ iP,K),u-{p,K)}. 

(b) If we write i''^{P, K) = i^^-^^5-i + i^q6o + i^i6i, thenv~{P,K) = vf5-i + 

(c) For each choice of sign, (/?, K) is a continuous function for K > Kc^^ (/3) , 
and as K ^ {kP {P))+ , v^{P,K) u^{P,kP {p)) ^ pp. Therefore, £p^K ex- 
hibits a discontinuous bifurcation at Kc^\p). 

We prove Theorems 3.1 and 3.2 in several steps. In the first step, carried 
out in Section 3.1, we absorb the noninteracting component of the energy per 
particle into the product measure of the canonical ensemble. This reduces 
the model to a Curie- Weiss-type model, which can be analyzed in terms of 
the empirical means 5„/n = J2]j=i ^j/n- The structure of the set of canoni- 
cal equilibrium macrostates for this Curie- Weiss-type model is analyzed in 
Section 3.2 for < /3 < /^c and in Section 3.3 for P > Pc- In Section 3.4 we 
lift our results from the level of the empirical means up to the level of the 
empirical measures using the contraction principle, a main tool in the theory 
of large deviations. Finally, in Section 3.5 we derive Theorems 3.1 and 3.2 
from the results derived in Section 3.2 for < P < Pc and in Section 3.3 for 

P>Pc- 

3.1. Reduction to the Curie-Weiss model. The first step in the proofs 
of Theorems 3.1 and 3.2 is to rewrite the canonical ensemble Pn,f3,K in the 
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form of a Curie-Weiss-type model. We do this by absorbing the noninter- 
acting component of the energy per particle hn,K into the product measure 
of Pn,f3,K- Defining Sn{uj) =J2]=i^j^ write 

Pn,f3,K{duj) = ry ' exp[-n/3/i„,ii-(a;)]P„((ia;) 



1 

1 



■ exp 



■ exp 



■ exp 



-n(3 
n(5K 

n(3K 

2 



K 



n 



n 



Sn{uj) 



n 



2-1 n 



In this formula Z{P) = /^exp(— /?ti;|)p(da;j) and P„^^ is the product measure 
on A" with identical one-dimensional marginals defined in (3.1). 
We define 



/ exp 




/A" 





dP, 



Since Pn,/3,K is a probability measure, it follows that 



Zn{P,K) 



and, thus, that 
(3.2) Pn,p,K{duj) 



ZniP,K) 



■ exp 



[Z(/?)]« 



n 



Pn^/siduj). 



By expressing the canonical ensemble in terms of the empirical means 
Sn/n, we have reduced the BEG model to a Curie- Weiss-type model. Cramer's 
theorem ([9], Theorem II. 4.1) states that, with respect to the product mea- 
sures Pn,p, Sn/n satisfies the LDP on [—1,1] with rate function 

(3.3) Jp{z)=swi>{tz-cp{t)}. 
In this formula cp is the cumulant generating function defined by 
Cf3{t) =log ex-p{tuJi)pf3{duJi) 



(3.4) 



log 



l-he-P(e*-he~* 
1 + 2e~P 



Jg is finite on the closed interval [—1,1] and is differentiable on the open 
interval (—1, 1). This function is expressed in (3.3) as the Legendre-Fenchel 
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transform of the finite, strictly convex, differentiable function cp. By the 
theory of these transforms ([22], Theorem 25.1, [9], Theorem VI. 5. 3(d)), for 
each z G (—1, 1), 

(3.5) J'p{z) = {c'p)-\z). 

From the LDP for Sn/n with respect to Pn,p^ Theorem 2.4 in [10] gives 
the LDP for Sn/n with respect to the canonical ensemble written in the 
form (3.2). 

Theorem 3.3. With respect to the canonical ensemble Pn^p.K written 
in the form (3.2), the empirical means Sn/n satisfy the LDP on [—1, 1] with 
rate function 

(3.6) ip^K = Mz) - l3Kz^ - mUJf^it) - pKt^}. 

In Section 2 the canonical ensemble for the BEG model is expressed in 
terms of the empirical measures Ln- The corresponding set Sjs^k of canon- 
ical equilibrium macrostates is defined as the set of probability measures 
v G 7-'(A) for which the rate function //j,/^ in the associated LDP satisfies 
1/3, k{^) = [see (2.7)]. By contrast, in (3.2) the canonical ensemble is ex- 
pressed in terms of the empirical means Sn/n. We now consider the set 
S/s^K of canonical equilibrium macrostates for the BEG model expressed in 
terms of the empirical means. Theorem 3.3 makes it natural to define Sjs^x 
as the set of z G [~1; 1] for which the rate function in that theorem satisfies 
Ip,k{z) = 0. Since z is a zero of this rate function if and only if z minimizes 
Jl3{z) — (3Kz^, we have 

(3.7) Sf^^K = {z G [—1, 1] : z minimizes Jjjiz) — f3Kz^}. 

As we will see in Theorem 3.13, each z G Sp^K equals the mean of a corre- 
sponding measure u G £p,K- Thus, each z G £p,K describes an equilibrium 
configuration of the model in terms of the specific magnetization, or the 
asymptotic average spin per site. 

Although Jp{z) can be computed explicitly, the expression is messy. In- 
stead, we use an alternative characterization of Sp^K given in the next propo- 
sition to determine the points in that set. This proposition is a special case 
of Theorem A.l in [7]. 

Proposition 3.4. For 2: G M, define 

(3.8) Gp,K{z) = pKz^ - cp{2(3Kz). 
Then for each /? > and K 

(3.9) min{ J;3(z) - PRz"} = min{G;3,^(z)}. 

\z\<l z&K 
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In addition, the global minimum points of Jp{z) — ^Kz"^ coincide with the 
global minimum points of Gp^K- o, consequence, 

(3.10) ^P,K = {z ^'R: z minimizes Gp^xi^)}- 

Proof. The finite, convex function f{z) = Cf3{2f3Kz)/{2l3K) has the 
Legendre-Fenchel transform 

r(z)=sup{x.-/(.)}=H^(^)/(2W' 

We prove the proposition by showing the fohowing three steps: 

1. sup,eR{/(^) - zy2} = sup|,|<i{zV2 - f*{z)}. 

2. Both suprema in step 1 are attained, the first for some z € M and the 
second for some z € (—1, 1). 

3. The global maximum points of f{z) — coincide with the global max- 
imum points of — f*{z). 

The proof uses three properties of Legendre-Fenchel transforms: 

1. For all zeR, f**{z) = {f*)*{z) equals f{z) ([9], Theorem VI.5.3(e)). 

2. If for some x G M and z E M, we have z = f'{x), then f{x) + f*{z) = xz 
([22], Theorem 25.1, [9], Theorem VI. 5. 3(c)). In particular, if z = x, then 
f{x) + f*ix)=x^. 

3. If there exists x € (—1, 1) and y € M such that 

(3.11) f*{z)>f*{x)+y{z-x) for aUze[-l,l], 

then y = (/*)'(x) ([22], Theorem 25.1). Hence, by properties 1 and 2, 

f*{x) + f**iy) = rix)+fiy)=xy. 

In particular, if (3.11) is valid with y = x, then f{x) + f*{x) = x^ . 

Step 1 in the proof is a special case of Theorem C.l in [8]. For complete- 
ness, we present the straightforward proof. Let M = sup^^^{f{z) — z'^/2}. 
Since for any \z\ < 1 and x G M 

f*{z) + M >xz- fix) + M>xz- xV2, 

we have 

f*{z) +M> sup{xz - x'^/2} = z'^/2. 

It follows that M > z'^/2 - f*{z) and thus that M > sup|^|<i{2V2 - f*{z)}- 
To prove the reverse inequality, let = sup|j,|<i{z^/2 — f*{z)}. Then for 
any z S M and |2;| < 1, 

z^/2 + N>xz- x'^/2 + N>xz- f*{x). 
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Since f*{x) = oo for |x| > 1, it follows from property 1 that 

+ N> sup {xz - f*{x)} = f{z) 

\x\<l 

and thus that > svc[)^iz^{f{z) — z^ /2}. This completes the proof of step 1. 

Since f{z) ~ \z\ as z — > oo, f{z) — z^ /2 attains its supremum over M. Since 
2:^/2 — f*{z) is continuous and ^vxi\z\^i{f*)' {z) = oo, z^ /2 — f*{z) attains its 
supremum over [—1, 1] in the open interval (—1, 1). This completes the proof 
of step 2. 

We now prove that the global maximum points of the two functions co- 
incide. Let X be any point in R at which f(z) — z^/2 attains its supre- 
mum. Then x = f'{x), and so by the second assertion in property 2, f{x) + 
f*{x) = X? . The point x lies in (—1, 1) because the range of fi^z^ = c'p{2f3Kz) 
equals (—1,1). Step 1 now implies that 

sup {^2/2 - r (^)} = sup{/(z) - zV2} 
|2|<i zeM 

= /(x)-xV2 = xV2-r(x). 

We conclude that z^/2 — f*{z) attains its supremum at x € (—1, 1). 

Conversely, let x be any point in (—1, 1) at which z'^/2 — f*{z) attains its 
supremum. Then for any z G [— 1, 1], 

xV2-r(x)>zV2-r(^). 

It follows that, for any z e [—1, 1], 

f*{z) > f*{x) + (^2 - x'^)/2 > /*(x) + x{z - x). 

The second assertion in property 3 implies that /*(x) + /(x) = x^, and, in 
conjunction with step 1, this in turn implies that 

sup{/(z) - zV2} = sup{zV2 - r(z)} 
zeM. \z\<i 

= xV2-r(:c) = /(x)-xV2. 

We conclude that f{z) — z^ 12 attains its supremum at x. This completes 
the proof of the proposition. □ 

Proposition 3.4 states that £p^K consists of the global minimum points of 
Gp^xi^) = (iK z^ — cp{2[3K z) . In order to simplify the minimization problem, 
we make the change of variables z z/{2[3K) in Gp^x, obtaining the new 
function 
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Proposition 3.4 gives the alternative characterization of Sj3,K to be 

(3.13) £p,K = ^^--w minimizes Fp^Kiw)^. 

We use Fp^K to analyze £j3^K because the second term of Fp^K contains only 
the parameter /3, while both terms in Gp^x contain both parameters jS and 
K. In order to analyze the structure of Sf^^K-, we take advantage of the simpler 
form of Fp^ii by fixing (3 and varying K. This innovation makes the analysis 
of Sp^K much more efficient than in previous studies. Our goal is prove that 
the elements of £f3^K change continuously with K for all < /3 < /3c = log 4 

(Theorem 3.1) and have a discontinuity at i^i^^ for all (3 > f3c (Theorem 3.2). 

In order to determine the minimum points of F^^k and, thus, the points 
in Sp^Ki we study the derivative 

(3.14) F'p^^{w) = ^-c'p{w). 

F'p j^{w) consists of a linear part w/{2(3K) and a nonlinear part c'p{w). As we 
will see in Sections 3.2 and 3.3, the basic mechanism underlying the change 
in the bifurcation behavior of Sp^K is the change in the concavity behavior 
of c'i^{w) for < (3 < Pc versus /3 > f3c, which is the subject of the next 
theorem. A related phenomenon was observed in [11], Theorem 1.2(b), and 
in [13], Theorem 4, in the context of work on the Griffiths-Hurst-Sherman 
correlation inequality for models of ferromagnets; this inequality is used to 
show the concavity of the specific magnetization as a function of the external 
field. 

Theorem 3.5. For (3> [3c = log 4, define 

(3.15) 'u;c(^) =cosh-^(ie^-4e-^) >0. 
The following conclusions hold: 

(a) For < (3 < /3c, c^('w) is strictly concave for w > 0. 

(b) For (3 > (3c, c'p{w) is strictly convex for < w < Wc{(3) and c'p{w) is 
strictly concave for w > Wc{(3). 

Proof, (a) We show that for all < /3 < /3c, c"p{w) < for all w > 0. A 
short calculation yields 

[2e~^ sinh w] [1 - 26"^ cosh w - Se'^/^] 



(3.16) d'^iw) 

Since 2e~^sinh?i; ai 
If; > if and only if 



[1 + 2e-^coshu;]3 

Since 2e~^sinh?i; and 1 + 2e"^coshu; are positive for u; > 0, c"p{w) < for 



1 - 2e"^ cosh w - 8e"^^ < for iv > 0. 
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The inequality cosh w > 1 for w > imphes that 



[1 - 2e-^coshi« - 8e-2/3] < [1 - 2e"^ - Se"^/^] 



(l-4e-^)(l + 2e-^) for ah > 0. 



Therefore, for all < /3 < log 4, c'^'{w) < for > 0. 

(b) Fixing (3 > (3c, we determine the critical value Wc{P) such that c'p{w) 
is strictly convex for < w < wdP) and strictly concave for w > Wc{(3). 
From the expression for c'^'{w) in (3.16), c'^'{w) > for u; > if and only if 
(1 — 2e~^ coshti) — 8e~'^^) > for w > 0. Therefore, c'i^{w) is strictly convex 
for 



This completes the proof of part (b). □ 

The concavity description of c'^ stated in Theorem 3.5 allows us to find 
the global minimum points of Fj^^x and thus the points in £/3^k for all values 
of the parameters /? and K. We carry this out in the next two sections, first 
for < (3 < Pc and then for /? > f3c. In Section 3.4 we use this information 
to give the structure of the set <?/3,_ft: of canonical equilibrium macrostates 
defined in (2.7). 

3.2. Description of S/s^k for < (3 < I3c- In Theorem 3.1 we state the 
structure of the set of canonical equilibrium macrostates for the BEG 
model with respect to the empirical measures when < f3 < Pc = log 4. The 
main theorem in this section, Theorem 3.6, does the same for the set £p 
which has been shown to have the alternative characterization 



We recall that Fpxiw) =w'^/{ApK) — cp{w), where is defined in (3.4). 
In Section 3.4 we will prove that there exists a one-to-one correspondence 
between Sp^x and £p,K- In Section 3.5 we will use this fact to fully describe 
the latter set for all < /3 < /5c and K >i). 

According to part (a) of Theorem 3.5, for < /J < /3c, d^iw) is strictly 
concave for li; > 0. As a result, the study of Sp^K is similar to the study 
of the equilibrium macrostates for the classical Curie- Weiss model as given 
in Section IV. 4 of [9]. Following the discussion in that section, we first use 



0<u;<cosh~i(ie^-4e-^). 

On the other hand, since c"p{w) < for w > if and only if (1 — 2e~^ cosh w 
8e~^^) < for t/; > 0, we conclude that c'p{w) is strictly concave for 

w > cosh"^(ie'^ - Ae~^). 



(3.17) 
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a graphical argument to motivate the continuous bifurcation exhibited by 
£f3,K for < /3 < /9c- A detailed statement is given in Theorem 3.6. 

Minimum points of Fp^j^ satisfy F'^ j^{w) = 0, which can be rewritten as 

(3.18) W='!'<'"'^ 

Since the slope of the function w w/{2PK) is 1/{2PK), the nature of the 
solutions of (3.18) depends on whether 

4m or o<^<c;;(o). 

This motivates the definition of the critical value 

<^-'^> 2^ = 4^ + 2)^ 

We use the same notation here as for the critical value in Theorem 3.1 
because, as we will later prove, the continuous bifurcation in K exhibited 
by both sets <f/3,A' and ^/3,a' occur at the same value Kc {(3) . 

We illustrate the minimum points of Fp^x graphically in Figure 1 for 
(3 = 1. For three ranges of values of K, this figure depicts the two components 
of F'^ j^: the linear component w/{2(3K) and the nonlinear component c'i^{w). 

Figure 1(a) corresponds to < < K'i'\(3). Since c^(0) = l/(2/3i^f^ (/?)), 

for < A' < Kc {(3) , the two components of ^ intersect at only the ori- 
gin, and, thus, Ffj^K has a unique global minimum point at if = 0. Fig- 

(2) 

ure 1(b) corresponds to K = Kc {(3). In this case the two components of 
F'l^ if a-^s tangent at the origin, and again Fp^x has a unique global minimum 

point at w = 0. Figure 1(c) corresponds to K > Kc {(3). For such K, the 
global minimum points of Fp^x are symmetric nonzero points w = ±w{(3, K), 
w{P,K)>0. 

Figures 1(a) and 1(c) give similar information as Figures IV. 3(b) and IV. 3(d) 
in [9], which depict the phase transition in the Curie- Weiss model. In these 
two sets of figures the functions being graphed are Legendre-Fenchel trans- 
forms of each other. 

The graphical information just obtained concerning the global minimum 
points of Fp^x for < (3 < Pc motivates the form of Sp^x stated in the next 
theorem. The positive quantity z{(3,K) equals w^P, K)/{2(3K); w{(3,K) is 

(2) 

the unique positive global minimum point of F^^x for K > Kc {(3), the 

existence of which is proved in Lemma 3.7. According to part (c) of the 

(2) 

theorem, z{(3,K) is a continuous function for K > Kc (/?), and as K 
{Kc {(3))~^ , z{(3,K) converges to 0. As a result, the bifurcation exhibited by 
Sp^x at Kc {(3) is continuous. 
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(a) (b) (c) 




Fig. 1. Continuous bifurcation for jS = 1. (a) K < Ki'\f3), (b) K = Kf\l3), (c) 



Theorem 3.6. Define £p^K by (3.7); equivalently, 

£p,K = I 2I3K ^ minimizes Fp^xiw) 



For all < P < Pc, the critical value Kc^\p) = l/(2/?c^(0)) has the following 
properties: 



(a) ForQ<K< K^' (/?) , £pj^ = {0} . 

(2'] 

(b) For K > Kc (P), there exists a positive number z{f3^K) such that 

(2) 

(c) z{f5jK) is a strictly increasing continuous function for K > Kc (P), 
and asK^ {K^c > ziP, K) 0. Therefore, £(3,K exhibits a continuous 

(2) 

bifurcation at Kc (P) ■ 

The proof of the theorem depends on the next lemma, in which we show 

(2) 

that, for K > Kc (P), F(3,k has a unique positive global minimum at a 
point w{P,K). 

Lemma 3.7. For < /3 < /3c = log 4, define Fp^K by (3.12). The following 
conclusions hold: 

(2\ 

(a) For each K > Kc (/?), Fi3,k has a critical point w{P,K) > satisfy- 
ing 

FI,^k{w{P,K))=0 and FIk{w{P,K)) > {). 

(b) For each K > Kc {P), Fp,K has unique nonzero global minimum 
points at w = ±'w{P,K). 

(2) 

(c) The points {w{P, K), K > Kc (/?)} span the positive real line; that is, 

(2) 

for each x > 0, there exists K > Kc (P) such that x = w{P,K). 
Proof, (a) For any k>kP{P), we have 



1 1 „ c'p.(w) 

(3.20) — - < = 4(0) = hm 

2PK 2PkP{P) ^ ---0 w 
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Since is continuous, for sufficiently small w > 0, we have w/{2j3K) < 
c'p{w) and, thus, F'^ j^{w) < 0. On the other hand, |c^(w^)| < 1 for all w and, 
therefore, \\m.uj^oo F'^ j^{w) = oo. It follows that Fp j^{w) > for sufficiently 
large w > 0. Consequently, by the continuity of ^, there exists at least one 
positive critical point ofF/^^x] the analyticity of -F/3,_ft: implies that Fp^K has 
at most finitely many critical points. Denote by w{P,K) > the smallest 
positive critical point of Fp^^^. 

We now prove that F'^ j^{'w{(3, K)) > 0. Since F'^ j^{w{(3, K)) = 0, the mean 
value theorem yields the existence of a G {0,w{P, K)) such that 

c'piw{P,K)) _ 1 

By part (a) of Theorem 3.5, since a < ui^PjK), it follows that c^(a) > 
c'p{w{P,K)) and thus that 

(3.21) F'f^^j,{w{P, K)) = ^- 4{w{P, K))>^- 4(a) = 0. 

This completes the proof of part (a). 

(b) For any w > w{(3,K), the strict concavity of c^(w) for w > [Theo- 
rem 3.5(a)] implies that c'p{w) < Cp{w{l3, K)). Therefore, by (3.21), we have 

> ^ - K)) = FIj,{w{P, K)) > 0. 

Thus, F'p ^ is strictly increasing for w > w{(5,K). This property allows us to 
conclude that w{l3,K) is the unique positive critical point and the unique 
positive local minimum point of Fp^K- By symmetry, F^^k has a unique 
negative local minimum point aX, w = —w{f3,K). In addition, as shown 
in (3.20), for any K > K^{f3), we have F^';^(0) = l/(2/3K) - c^(0) < 0. Since 
lim|„,|_^oo /^(w) = oo, we conclude that ±w{(3,K) are the unique global 
minimum points of Fp^x- 

(c) Given x > 0, define the positive number = x/{2(3c'p{x)). Then 

FL r. (x) = — ^ c'Jx) = 0. 

Since c'p{w) is strictly concave for w > 0, we have c^(0) > c'p{x)/x, and, 
therefore. 
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It follows that 2; is a positive critical point of Fp^x for K = > Kc (/?); by 
the uniqueness of the positive critical point, x = w{l3, Kx). This completes 

the proof that the points {w{/3, K),K > Kc (0)} span the positive real line. 
□ 

Proof of Theorem 3.6. (a) For < /sT < kP{P), -F^^^(O) = 0, and, 
thus, li; = is a critical point of F/3,k- We prove that = is the unique 
global minimum point of Fjs^x by showing that, for w > 0, F'^ xi''^) ^ ^ ^^"^ 
for w <0, Fpj^{w) < 0. Since c'p{w) is strictly concave for w > [Theo- 
rem 3.5(a)], for any w > 0, we have c'^{0) > c'fj{w)/w. As a result, for all 

> and all < < = l/(2/3c^(0)). 

On the other hand, since F'^ ^ is an odd function, F'^ j^{w) < for all w < 0. 
Therefore, it; = is the unique global minimum point of F^^k- It follows 

that, for Q<K< kP{P), £p^K = {0}. 

(b) For K > Kc (/?), let w{(5,K) be the unique positive global minimum 
point of Fjs^K, the existence of which is proved in part (a) of Lemma 3.7, and 

define zij3,K) = w{P,K)/{2pK). It follows that, for K > kP{P), £p k = 
{±z{P,K)}. 

(c) By part (a) of Lemma 3.7, 

FI,^k{w{(3,K)) = and K)) > 0. 

The implicit function theorem implies that, for K > Kc (/?), w{(5,K) and, 
thus, z(/3, K) are continuously diff'erentiable functions of K and are thus 
continuous. Straightforward calculations yield 

dw{l3,K) _ w{f3,K) 

dK ~ 2I3K^FIj,{w{P,K)) 

and 

d~z{f3,K) ^ 2(3w{(3,K) / c"p{w{P,K)) \ 
dK {2(5KY \Fl^{w{(3,K)))- 

Since w{P,K) is positive and both c'^{w{l3,K)) > and Fp -^{w{j3,K)) > 0, 

(2) 

w{l3,K) and z{(j,K) are strictly increasing functions for K > K), '{(5). 

As K \ Kc (P)-, w{j3^K) > 0, w{j3^K) is strictly decreasing, and the 
points {w{l3^K)^K > Kc (/?)} span the positive real line [Lemma 3.7(c)]. 
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We conclude that lim^^^(2)^^^_|. w{P, K) = and thus that hm^^^(2)^^^^_ K) = 
0. This completes the proof of the theorem. □ 

Theorem 3.6 describes the continuous bifurcation exhibited by £p^K for 
< /3 < j3c- Theorem 3.8 in the next section describes the discontinuous bi- 
furcation exhibited by Sp^K for /3 ™- the complementary region (3 > f5c- 

3.3. Description of £p^K for P > (3c- In Theorem 3.2 we state the struc- 
ture of the set Sf^^K of canonical equilibrium macrostates for the BEG model 
with respect to the empirical measures when P > (3c- The main theorem in 
this subsection, Theorem 3.8, does the same for the set Sp^K, which has been 
shown to have the alternative characterization 

(3.22) £/s^K = I eM:w minimizes F^^k (w) | . 

As in Section 3.2, Fp^x{w) = / {^(3K) — cp{w)^ where is defined in (3.4). 
In Section 3.4 we will prove that there exists a one-to-one correspondence 
between £p^K and £p,K- In Section 3.5 we will use this fact to fully describe 
the latter set for all (3 > (3c and K > 0. 

Minimum points of Fp^x satisfy the equation 



In contrast to the previous section, where for < (3 < Pc, c'p{w) is strictly 
concave for > 0, part (b) of Theorem 3.5 states that, for /? > /3c, there 
exists Wc{P) > such that c^(tt') is strictly convex for w G {0,Wc{P)) and 
strictly concave for w > Wc{P)- As a result, for /? > /3c, we are no longer in 
the situation of the classical Curie-Weiss model for which the bifurcation 
with respect to K is continuous. Instead, for [3 > Pc, K increases through 
the critical value k'^\p), ip^K exhibits a discontinuous bifurcation. 

While the discontinuous bifurcation exhibited by £/3^k for P > Pc '^s easily 
observed graphically, the full analytic proof is more complicated than in the 
case < P < Pc- As in the previous subsection, we will first motivate this 
discontinuous bifurcation via a graphical argument. A detailed statement is 
given in Theorem 3.8. 

For P > Pc, we divide the range of the positive parameter K into three 
intervals separated by the values Ki = Ki(P) and K2 = K2{P)- Ki is de- 
fined to be the unique value of K such that the line w/i2PK) is tangent 
to the curve c^Jj at a point wi = wi{P) > 0. The existence and uniqueness of 
Ki and wi are proved in Lemma 3.9. K2 is defined to be the value of K 
such that the slopes of the line w/{2PK) and the curve c'^ at w = Q agree. 
Specifically, 



(3.23) 




w 



2pK 



(3.24) 



1 1 1 



K2 



2/3^(0) 4/3e-/3 2/3' 
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Figure 2 represents graphically the values of Ki and K2 for /? = 4, showing 
that Ki < K2. In Lemma 3.9 it is proved that this inequality holds for all 
/3>/3c. 

In each of Figures 3-7, for fixed P > (3c and for different ranges of values 
oi K > 0, the first graph (a) depicts the two components of F'^ j^: the linear 
component w/{2f3K) and the nonlinear component dp. The second graph 
(b) shows the corresponding graph of Fp x- In these figures the following 
values of j3 were used: /? = 4 in Figures 3, 5, 6, 7 and /3 = 2.8 in Figure 4. 

As we see in Figure 3, for Q < K < Ki, the linear component intersects 
the nonlinear component at only the origin and, thus, Fp x h^-s a unique 
global minimum point at w = 0. Since /? is fixed, the graph of the nonlinear 
component c'g also remains fixed. As K increases, the slope of the linear 






Fig. 2. Graphical representation of the values K\ and K2 for P = A. 







Fig. 6. (a) Graph of two components of F'^ j^ and (b) graph of Fp^K for 
K^^\l3)<K <K2. 



component w/{2j3K) decreases, leading to the discontinuous bifurcation in 
£l3,K with respect to K. 

The graph of F^ k is depicted in Figure 4 for K > K2- We see that 
Fj^^x has two global minimum points at w = ±w{P,K), where w{P,K) is 
positive. Therefore, for < K < Ki, we have £p^K = {0} and for K > K2, 
we have Sp^x = {^z{(3, K)}, where z{l3,K) = w{P, K)/{2(3K) is positive. 
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(a) 





c'Jw) 



w 




Fig. 7. (a) Graph of two components of F^ j^ and (b) graph of Fp^K for K — K'^\j3) 



Now suppose that K € {Ki,K2)- In this region there exists w[(3^K) > 
such that Fp^x h^-s three local minimum points at li; = and w = ±w{p, K). 
As we see in Figure 5, for K > Ki but sufficiently close to Ki, Fp^xi^) < 
Fj3,K{w{l3^K))] as a result, the unique global minimum point of Fp^x is 
It; = 0. On the other hand, we see in Figure 6 that, for < i^T < K2 but 
sufficiently close to K2, Fp ^i^) > Fp^K{w{l3, K)); as a result, the global 
minimum points of Fp^x are w = zizw{P,K). As K increases over the inter- 
val {Ki,K2), Fp^K{w{l3,K)) decreases continuously (Lemma 3.12). Conse- 
quently, as Figure 7 reveals, there exists a critical value K^\j3) such that 



= F (1) {w{P,K)); as a result, the global minimum points 
are w = and w = ±w{P, K). 
'(1) 



We use the same notation Kc (/?) as for the critical value in Theorem 3.2. 
As we will later prove, the discontinuous bifurcation in K exhibited by both 
sets £p^K and £p^K occur at the same point Kc^\p). 

The graphical information just obtained concerning the global minimum 
points of Fp^K for P > f3c motivates the form of Sp^K stated in the next theo- 
rem. The positive quantity z{P,K) equals K)/(2/3K), where w{(3,K) is 
the unique positive global minimum point of Fp^x ^or K > Kc^^ (/3) [Lemma 3.10 
According to part (d) of the theorem, z{(3,K) is a continuous function for 



K > K^^\(3), and as ^ (K^^^ (/?))+, z{P,K) converges to the positive 
quantity z{(3, Kc^\(3)). Hence, the bifurcation exhibited by Sp^x at Kc^\(3) 
is discontinuous. As we will see in the proof of Theorem 3.8, Kc^\(3) is the 
unique zero of the function A{K) defined in (3.31) for K > Ki{(3); Ki{f3) is 
specified in Lemma 3.9. 



'(1) 



Theorem 3.8. Define £p^K by (3.7); equivalently, 



W 



2/3K 



G R:tt; minimizes Fp^x{w) >■ 
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For all (3 > (3c = log 4, there exists a critical value Kc^\f3) satisfying Ki < 
K^^ {(3) < K2 and having the following properties: 

(a) ForO<K< K^^\(3), £f3,K = {0}. 

(b) For K = K^c\p), £p^K = {Q,^z{l3,K)}, where z{P, K) > 0. 

(c) For K> kP{P), £^^K = {±z{P,K)}, where z{p, K) > 0. 

(d) ForK>KP{l3), z{l3,K) is a strictly increasing continuous function, 
and as K ^ {K^c\(3))+ , z(j3,K) z{(3, K^^\/3)) > 0. Therefore, £p^K ex- 
hibits a discontinuous bifurcation at Kc^\p). 

The proof of the theorem depends on several lemmas. In the first lemma 
we prove that, for each P > (3c, there exists a unique K = Ki{P) such that 
the line w/{2[3K) is tangent to the curve at a point wi{(3) > 0. 

Lemma 3.9. For /3 > /3c = log 4, we define cp by (3.4), F^^k by (3.12), 
Wc{l3) by (3.15) and K2 = K2{(3) by (3.24). Then in the setw>0,K> 0, 
there exists a unique solution {wi,Ki) = {wi{f3), Ki{(3)) of 

w 

(3.25) F'p^K{w) = ^-c'f,{w) = 0, 

(3.26) f;^^j,{w) = ^-4{w)=0. 

Furthermore, wi > Wc{(3) and Ki < K2 for all [3 > f3c- 

Proof. The function g^w) = wc!p{w) — c'p{w) has the properties that 
g'{w) =wc"p{w) and that solutions of (3.25)-(3.26) solve g{w) =0. Accord- 
ing to part (b) of Theorem 3.5, c'p{w) is strictly convex for < u; < Wc{(3) 
and c'ij{w) is strictly concave for w > wdP); equivalently, c'^'{w) > for 
<w < Wc{f3) and c'^'{w) <0 for w > Wc{f3). Therefore, 

(3.27) g'{w)>0 for <w <Wc{(3) and g'{w) <0 for w > Wc{(3). 
Since 

2we~^ coshu; + 4?i;e~^'^ 

WCgiw) = T- 3 TK — , 

' [l + 2e-^cosh«;]2 
we see that Xvca^^^wd'^iyS) = 0. It follows that 

lim q(w) = lim wc'a(w) — lim c'a(w) = — 1. 

This limit and the fact that 5(0) = 0, combined with the continuity of g 
and (3.27), imply that there exists a unique w\ > Wc{f3) such that g{wi) = 0; 
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that is, 



(3.28) Jl^ = c^(^,). 
Substituting wi into (3.25) and (3.26), we define 

(3.29) Ki ' 



2[ic"p{wi) 2[idp{wiy 

The pair {wi^Ki) is a solution of (3.25)-(3.26) in the set w > d^K > 0. If 
{w, K) is another solution of (3.25)-(3.26) in this set, then w solves g{w) = 0, 
a contradiction to the fact that wi is the unique positive solution of g{w) = 0. 
It follows that {wi,Ki) is the unique solution of (3.25)-(3.26) in the set 
■w>0, if >0. 

We complete the proof by showing that Ki< K2. Since K2 = l/(2/3c^(0)), 
we are done if we show that c'p{wi) > c^(0). By the mean value theorem 
and (3.28), there exists a G {0,wi) such that 

(3.30) c';,(^a) = i^ = 4{wi). 

We claim that a < Wc{P)- If a > wdP), then since c'^ is strictly concave 
on {wc{l3),oo), the inequalities Wc{(3) <a<wi imply that c^(a) > c"p{wi). 
Because this contradicts (3.30), we conclude that a < Wc{P)- This inequality 
in combination with the strict convexity of c'^ on {0,Wc{P)) and (3.30) yields 
c'p{0) < c'p{a) = c'p{wi). The proof of the lemma is complete. □ 

We next state two lemmas that are analogous to Lemma 3.7 and part (c) of 
Theorem 3.6. Before stating them, we need some preliminaries. In Lemma 3.9, 
we proved that, for f3 > f3c, equations (3.25)-(3.26) have a unique solution 
{wi,Ki) = {wi{(3), Ki{f3)) in the set w > 0, K > and that wi > Wc{P); ac- 
cording to (3.25), 

1 

2/3^1 

In addition, for < /3 < /?c, the quantity kP (13) = l/(2/?c^(0)) introduced 
in (3.19) has the property that 

For (3 > (3c, c'p{w) is strictly concave for w > Wc{P) [Theorem 3.5(b)]. Thus 
for w>wi, the graph of F^^Xiiw) over the interval [wi, 00) for f3 > (3c (Fig- 
ure 2) is similar to that of F „(2) (w) over the interval [0, 00) for < (3 < (3c 
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[Figure 1(b)]. Specifically, for (3 > (3c and w G [wi, oo), the graph of 



is determined by the difference between the strictly concave function c'p{w) 
and the linear function w / {2(3Ki), which is tangent to at w = wi. Simi- 
larly, for < /? < /3c and w G [0, cx)), the graph of 



is determined by the difference between the strictly concave function c'^iw) 

(2) 

[Theorem 3.5(a)] and the linear function w/{2(3Kc (/?)), which is tangent 
to c'p w = 0. 

As we saw in Section 3.2 for < /3 < /3c, as K increases from Kc (/3) 
and thus the slope of the line w/{2j3K) decreases, Fp^K develops a unique 
positive local minimum point w{P,K), which is shown to be the unique 
global minimum point on the interval [0, oo) [Lemma 3.7(b)]. This can be 
seen graphically in Figure 1(c). Similarly, as Figures 4(b)-7(b) illustrate, 
for /3 > /3c, as K increases from Ki, Ff^^K develops a unique positive lo- 
cal minimum point w{(3,K). As in part (b) of Lemma 3.7, w{(5,K) can be 
shown to be the unique global minimum point on the interval [wi, oo). How- 
ever, it is not a global minimum point on the entire halfline [0, oo) unless 
Fi3,K{w{f3, K)) < = Ff3^K{0); in fact, this inequality is valid only for all 
K sufficiently large. When Fp^x{w{l3,K)) > = i^/3,_ft:(0), which holds for 
all K > Ki sufficiently close to Ki, is the unique global minimum point 



Because the behavior of the function -F/j,/^ over the interval [wi,oo) for 
/3 > /3c is similar to that of Ff^^x over the interval [0, oo) for < /3 < /3c, the 
proofs of Lemma 3.10 and Lemma 3.11 are analogous, respectively, to the 
proofs of Lemma 3.7 and part (c) of Theorem 3.6. Therefore, we state these 
new lemmas without proof. 

In Lemma 3.10 we state the existence and two properties of a positive 
critical point w{(3,K) of F^^k for each K > Ki. 

Lemma 3.10. For j3> /3c = log4, define Fpj^ by (3.12) and let {wi,Ki) = 
{'Wi{P), Ki{P)) be the unique solution of (3.25)-(3.26) in the set w > 0, K > 
(Lemma 3.9). The following conclusions hold: 

(a) For each K > Ki, Fp x a critical point w{l3,K) > wi satisfying 



(b) For each K > Ki, Fp^x has unique nonzero local minimum points at 
w = ±w{l3,K). 





of Ffj^K- 



F'f,^K{w{P,K)) = and F^',^(d;(/3, K)) > 0. 
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(c) The points {w{(3,K),K > A'l} span the interval {wi,oo); that is, for 
each X > wi, there exists K > Ki such that x = 'w{(3,K). 

The next lemma states continuity and related properties of w{(3,K) and 
z{P, K) that are similar to properties of the analogous quantities for < /3 < 
(5c [Theorem 3.6(c)]. 

Lemma 3.11. For /J > /3c = log 4 and K > Ki, let w{(5,K) he the unique 
positive local minimum point of Fp^x considered in Lemma 3.10. Then for 
K > Ki, w{P,K) and z{(3,K) ='w{l3,K)/(2f3K) are continuous, strictly in- 
creasing functions of K and hm^^^+ w{(3^ K) = wi. 

We fix /3 > (3c- The proof of Theorem 3.8 also makes use of the function 

^ ^ ^ ^ [F/3,k[w{P,K)), iiK>Ki. 

The quantity w{P,K) isthe unique positive local minimum point of Fjj^K, 
the existence of which is given in Lemma 3.10. 

Lemma 3.12. For (3 > (3c = log4, the function D{K) defined in (3.31) 
is continuous and strictly decreasing on its domain [Ki[(3),oo). 

Proof. Since Fp^x{w) is a continuous function of w and w{P,K) is a 
continuous function of K (Lemma 3.11), D{K) is continuous for K > Ki. 
Furthermore, by part (c) of Lemma 3.11, lim^_^^+ tt)(/3, K) = wi and, thus, 

lim^^.^^+ Fp^x{w{(5, K)) = Fp^Ki {wi)- We conclude that D{K) is continuous 
on [Ki, oo). 

We now prove that D{K) is strictly decreasing on [Ki,oo). For K > Ki, 
we have 

by part (a) of Lemma 3.10. As in the proof of part (c) of Theorem 3.6, one 
can show that w[(5,K) is continuously differentiable for K > Ki. Hence, for 
K>Ki, 

,dF^j^^uHj3^ 
^ ' dK 

4(3K^ 
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This completes the proof. □ 



Proof of Theorem 3.8. As we showed in Lemma 3.9, for /3 > /3c, 
equations (3.25)-(3.26) have a unique solution {wi,Ki) = {'Wi{f5), Ki{f3)) in 
the setw>0,K> 0. In addition, Ki < K2 = l/(2/3cJ^(0)). We start the proof 
of Theorem 3.8 by proving the following two facts: 

1. For < A' < Ai, Fp^x ha-s a unique global minimum point at w = 
[Figures 2 and 3(b)]. 

2. For K > K2, Fp^K has unique global minimum points at w = ibu;(/3, K) 
(Figure 7). 

According to Lemma 3.10, F'^ j^{wi) = 0. Using concavity properties of 
c'f^{w) established in part (b) of Theorem 3.5 and calculations similar to 
those used to establish other results in this and the preceding section, one 
shows that, for < K < Ki , F'p j^{w) > for all -u; > and that F'^^^^ (w) > 
for all w > 0, w ^wi. These properties, which can be seen in Figure 2 and 
Figure 3(a), are proved in detail in Lemma 2.3.10 in [21]. By symmetry, for 
0<K < Ki, F'pj^iw) < for all u; < and Fpj^{w) < for all w <0,w ^ 
—wi. It follows that, for < K < Ki, Fpx is strictly decreasing for < 
and strictly increasing for w>Q. We conclude that, for < A' < Ai, F^^k 
has a unique global minimum point at w = 0, as claimed in fact 1. 

Since l\m.\yj\^_,^Fp^K{w) = 00, the global minimum values of Fp^^ must 
be attained at local minimum points of the function. Lemma 3.10 states 
that, for K > A'l, w = zizw{P,K) are the unique nonzero local minimum 
points of Ff^^x- Therefore, we prove that, for K > K2, Fp x has unique global 
minimum points at w = ibu;(/3. A) by proving that w = is a local maximum 
point of Fp^K- According to part (b) of Theorem 3.5, c'i^{w) is strictly convex 
for <w < Wc{(3). Therefore, for AT > K2 and w G (0, Wc{(3)), 

<2^-^/^(°)" = 2^-2^^°' 

that is, for w G (0, tt;c(/?)), F'^ j^{w) < 0. By symmetry, for w G {—Wc{P),0), 
F'p ^{w) > 0. It follows that w = is a local maximum point of F^^k- There- 
fore, as claimed in fact 2, for K > K2, Fp x has unique global minimum 
points at u; = ±it;(/3, A'). 

For Ai < AT < A'2, Fp^x has exactly three local minimum points at tt; = 
and w = ±w{P, A). Since global minimum values of Fp^x must be attained at 
local minimum points of the function and Fp^x is symmetric, finding global 
minimum points of Fp x requires comparing the values of the function at 
w = and at w = w{f3, A). 
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Since for < K < Ki Fp^K has a unique global minimum point at w = 0, 
we have 

D{Ki) = Fp^K, {wi) > Fp^K, (0) = 0. 

Similarly, since for K > K2 Fp^x has unique global minimum points at 
w = ±w{P, K), for K > K2, we have 

D{K) = Fp,K{w{f3, K)) < Ff3,K{0) = 0. 

Since D{K) is continuous and strictly decreasing for K > Ki (Lemma 3.12), 
there exists a unique critical value K^^\p) satisfying Ki < K^^\p) < K2 
and having the following properties: 

(i) For Ki<K <K^c^\p), 

Fp,K{w{(3, K)) = D{K) > = Ffs^KAO), 

and, thus, Fp^x has a unique global minimum point at w = 0. 

(ii) For K = k!^\(3), 

Fp,K{w{P, K)) = D{K) = = Fp^kM^ 

and, thus, F^^k has three global minimum points at w = 0, ibt()(/3, K). 

(iii) For kI,^\i3) < K < K2, 

Fp^KiwiP, K)) = D{K) < = Fp^K^iO), 

and, thus, Fp^x has two global minimum points at w = zizw{P,K). 

We define z{f3,K) = w{f3, K)/(2(3K). The form of Sp^K given in parts 
(a)-(c) of Theorem 3.8 follows from the information on the global minimum 
points of Fp x just given in items (i)-(iii) and from facts 1 and 2 stated at the 

start of the proof. In addition, the positivity of z{(3, Kc^\p)) is a consequence 
of the positivity of w{(5,k'^c\(5)). Since by Lemma 3.11 z{P,K) is a strictly 
increasing function for K > Kc^\f]), part (d) of the theorem is also proved. 
The proof of Theorem 3.8 is now complete. □ 

Together, Theorems 3.6 and 3.8 give a full description of the set Sp^K for 
all values of (3 and K. In the next section, we use the contraction principle to 
lift our results concerning the structure of the set Sp^K up to the level of the 
empirical measures, making use of a one-to-one correspondence between the 
points in the two sets SjS^K £^ud Sp^K of canonical equilibrium macrostates. 
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3.4. One-to-one correspondence between Sp^K nn-d ^j3,K- We start by re- 
calling the definitions of the sets £p^K and £p,K '■ 

(3.32) Sp^K = {iye V{A) : v minimizes R{v\p) + PfKii^)} 
and 

(3.33) Sp^K = {z ^ [—1, 1] : z minimizes Jpiz) — (3Kz^}. 

In the definition of £p^K^ R{f^\p) is the relative entropy of p with respect 
to yO = ^{S-i + (5o + 5i) and fxip) is the function defined in (2.3). In the 
definition of £p^K-, Jp is the Cramer rate function defined in (3.3). We now 
state the one-to-one correspondence between the points in Efs^K and the 
points in £p^K- According to Theorems 3.6 and 3.8, Sp^K consists of either 
1, 2 or 3 points. 

Theorem 3.13. Fix /? > and K >{) and suppose that Ep^K = {za}a=i, 
r = 1,2 or 3. Define a = 1, . . . , r, to be measures in V{A) with densities 

(3.34) ^(y)=exp(t,y) ^ 



dpi3 /Aexp(tc«y)p/3(iiy)' 

where ta is chosen such that J^yua{dy) = Za- Then for each a = 1, . . . ,r, 
ta exists and is unique, and £p^K consists of the unique elements Va^Oi = 
1, . . . , r . Furthermore, ta = 2j3KzoL for q = 1, . . . , r . 

For z G [—1,1], we define 

(3.35) A{z) = \peV{K): j^yp{dy)=z 

The proof of the theorem depends on the following two lemmas. Both lemmas 
use the contraction principle ([9], Theorem VIII.3.1), which states that, for 
all [-1,1], 

(3.36) Jp{z) = mm{R{p\pp) : p € A{z)}. 



Lemma 3.14. For j3 > and K > 0, 

2- 



mm 

MGP(A) 



(^Rip\pp)-(3K(^J^yp{dy)^ | = mm{Jpiz) - PKz^}. 
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Proof. The contraction principle (3.36) implies that 



mm 



= min^min|ii(^|p/3) - (3K(^J^yfi{dy)^ : fi e A{z) 
= min(min{i?(;^|p^) : /x G A{z)} - (3Kz^) 

\z\<l 

= m:in{Jp{z)- f5Kz^]. 

\z\<l 

This completes the proof. □ 

The second lemma shows that the mean of any measure u G £i3,k is an 
element of £p,K- 

Lemma 3.15. Fix /3 > and K > Q. Given v G £p,K, we define z = 
J^yi'{dy), where A = {—1,0,1}. ThenzGSp^K- 

Proof. Since v € £i3k, ^ is a global minimum point of R{^[x\pfi) — 
(3K{Jj^yfi{dy)f. Thus, for all fi G V{A), 

R{u\pp)-fiK( ( yuidy)"^ 



= R{u\pp) - /3Kz^ < R{p\pp) - (3K (^J^ ypidy)^ . 

In particular, this inequality holds for any p that satisfies Jj^yp{dy) = z. For 
such p, the last display becomes 

R{v\pp)<R{p\pp). 

Thus, V satisfies 

R{^\Pp) = min{i?(^jp^) : p G A{z)}, 

where A{z) is defined in (3.35). The contraction principle (3.36) and Lemma 3.14 
imply that 

Jp{z) - (3K~z^ = Riu\pp) - f3K (^1^ yiy{dy)^ ' 



\R{p\pf,)-(]K{ I ypidy)'^ 



■ mm 
MeP(A) I 



\z\<l ' ' J 
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Therefore, z £ £p,K^ as claimed. This completes the proof. □ 
We next prove Theorem 3.13. 

Proof of Theorem 3.13. A short calculation shows that, for any 
R{li\p) + l3fK{p) - ini {R{u\p)+(3fKW)} 

= R{p\pp) - OK (^^ yp{dy)^ ' - ^uii^l^R{v\pp) - (3K (^^ yi^(dy)) '| . 

Hence, we obtain the following alternate characterization of £/3j^- 

(3.37) £i3,K = ^V{A):u mmhmzes R{u\pf3) - l3K(^J^yiy{dy)j |. 

We first show for each a = 1, . . . ,r and Za € £i3,K, is the unique global 
minimum point of R{ii\pp) — f3K{j^yp,{dy)Y over 



A{z^) = {p(^ P(A) : / yp{dy) = z^ 
We then prove that 



inf \r{p\pp)-Pk( [ yp{dy))' 
■Ai^c) I \Ja J 

inf \r{p\pp)-Pk( f yp{dy)\ 



fj.eA{ze) 

for all Q, £ = 1, . . . , r. It will then follow that {i^a}a=i equals the set of global 
minimum points of R{p\pi3) — PK{J^yp{dy))'^ over the set A = Ua=i A{za)- 
Finally, by showing that all the global minimum points of R{p,\pp) — j3K{^^ yp{dy)y 
lie in A, we will complete the proof that Sjs^K = {''^aYa=i- If = 2 or 3, then 
since Jj^yi^a{dy) = Za, it is clear that if Za7^ zi, then v^i^vi- 

By Theorem VIII. 3.1 in [9], for each a = l,...,r, the point in the 
statement of Theorem 3.13 exists and is unique, 

(3.38) Jpizo) = RiualPp), 

and R{p,\pp) attains its infimum over A{za) at the unique measure Va- 
Therefore, for each a = l,...,r, is the unique global minimum point 
of R{p\pp) - l3K{Sf^ypi{dy)f over A(z«). 
We next show that 

inf \r{p\pp)-I3k([ yp{dy)]^^ 



Jnl^^^i^R{fi\pp) - PK(^J^yp{dy)y 
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for all a,i = 1, . . . ,r. Since Za,Z£ E S/s^k, and zi are global minimum 
points of Jpi^z) — (iKz^ . Thus, by (3.38), we have 



inf [R{ii\pf,)-i3K[\ y^i{dy) 



: inf R{^i\p(,)-l3Kzl 

:Jp{ze)-PKzj 

: inf R{p\pp)-f3Kzj 
ti&A{ze) 



fi£A{ze) 

As a result, {i'a}a=i equals the set of global minimum points of R{p\pp) — 
fiK{j^yii{dy)f over the set A = Ua=i Aiz^). 

Last, we show R[p\pp) — PK{J^y p{dy))'^ attains its global minimum at 
points in A. Let cr be a global minimum point of R{p\pfj) — (3K{Jj^yp{dy))'^ . 
By (3.37), this implies that a S £i3,k- Define C = /a U^idy). Then Lemma 3.15 
implies that C S £/3,k and, thus, that C = Za for some a = 1, . . . , r. It follows 
that a G A{za) C A for some a = 1, . . . , r. 

The last step is to prove that ta = 2(3Kza for a = 1, . . . ,r. From defini- 
tion (3.4), we have 



c'p{ta) = j^yVa{dy) = Za- 

In turn, the inverse relationship (3.5) implies that 

ta = (c^) ^^Za) = JfsiZa)- 

Therefore, since Za G £i3,K, tbe definition (3.33) guarantees that Z(y IS Si crit- 
ical point of Jp{z) — (3Kz'^. Thus, 

(3.39) ta = J'f3{Za)=2PKZa. 

This completes the proof of Theorem 3.13. □ 

In the next section we use Theorem 3.13 to prove Theorems 3.1 and 3.2. 

3.5. Proofs of Theorems 3.1 and 3.2. Theorem 3.1 gives the structure of 
the set £f3^K of canonical equilibrium macrostates, pointing out the contin- 
uous bifurcation exhibited by that set for < /3 < /3c = log 4. The structure 
of Sp^K for P > j3c, given in Theorem 3.2, features a discontinuous bifur- 
cation in K. The proofs of these theorems are immediate from Theorems 
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3.6 and 3.8, respectively, which give the structure of Sp^K for < /3 < /3c and 
for /3 > /3c, and from Theorem 3.13, which states a one-to-one correspon- 
dence between and £/3^k- 

Before proving Theorems 3.1 and 3.2, it is useful to express the measures 
P/^ and z^Q in Theorem 3.13 in the forms = p/s-iS^i + p/sfiSo + P/3,iSi and 

= + t^afiSo + i^a,!^!, respectively. Since ta = 2f3Kza, in terms of 

Za € Sf3,K we have 

_ e-^ _ 1 

and 

^-2l3Kza-l3 I 
" C{P,K) ' """'^ = CiP,K) 

Here 

Ci/3, K) = e-2/3^-"-/3 + ^WKz^-p ^ 

In particular, Va = Pfj when = 0. 

We first indicate how Theorem 3.1 follows from Theorem 3.6. Fix < 
P 1^ Pc- The critical value Kc {(3) in Theorem 3.1 coincides with the value 
Kf'{(3) in Theorem 3.6. For < K < kP{P), part (a) of Theorem 3.6 
indicates that £p^K = {0}; hence, Sp^K = {pp}- For K > Kc (/?), part (b) of 
Theorem 3.6 indicates that £/3,k = K)}, where K) > 0. It follows 

that the measures v^{f3,K) and v~{j3,K) in part (a)(ii) of Theorem 3.1 are 
given by (3.34) with Za = z{f3,K) and Za = —z{(3,K), respectively. Since 
z{f3,K) > 0, it follows that u+{(3,K) ^ u-{fi,K) / p^. Finally, part (c) of 
Theorem 3.6 allows us to conclude that, for each choice of sign, v^{l3,K) 

is a continuous functions for K > Kc {(3) and that as K ^ {Kc {(3))~^ , 
^ Pp. This completes the proof of Theorem 3.1. 

In a completely analogous way. Theorem 3.2, including the discontinuous 
bifurcation noted in part (c) of the theorem, follows from Theorem 3.8. 

In this section we have completely analyzed the structure of the set Ep^K 
of canonical equilibrium macrostates. In particular, we discovered that, for 

/ox 

< f3 < Pc, £p,K undergoes a continuous bifurcation at K = Kc (Theo- 
rem 3.1) and that, for /3 > /3c, Sp^K undergoes a discontinuous bifurcation at 

K = Kc^\p) (Theorem 3.2). We depict these bifurcations in Figure 8. While 

(2) 

the second-order critical values Kc (/?) are explicitly defined in Theorem 3.6, 

the first-order critical values Kc^^ (/3) in the figure are computed numerically. 

The numerical procedure calculates Kc^\p) for fixed values of P by deter- 
mining the value of K for which the number of global minimum points of 
Gp^k{z) changes from one at z = to three at z = and z = ±z{P, K), where 



^2l3Kza-l3 

^ C{P,K) ■ 
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Fig. 8. Bifurcation diagram for the BEG model with respect to the canonical ensemble. 

z(l3, K) > 0. According to these numerical calculations for the discontinuous 
bifurcation, it appears that Kc^\(3) tends to 1 as /? — s- oo. However, we are 
unable to prove this limit. 

In Section 5 we will see that Figure 8 is a phase diagram that describes 
the phase transitions in the canonical ensemble as (3 changes. We will also 
show that the nature of the bifurcations studied up to this point by vary- 
ing K, while keeping /3 fixed, is the same if we vary f3 and keep K fixed 
instead. The latter situation corresponds to what is referred to physically 
as a phase transition; specifically, the continuous bifurcation corresponds 
to a second-order phase transition and the discontinuous bifurcation to a 
first-order phase transition. In order to substantiate this claim concerning 
the bifurcations and the phase transitions, we have to transfer our analysis 
of Sp^K from fixed /? and varying K to an analysis of S/s^k for fixed K and 
varying (3. 

In the next section we study the BEG model with respect to the micro- 
canonical ensemble. 

4. Structure of the set of microcanonical equilibrium macrostates. In 

previous studies of the BEG model with respect to the microcanonical en- 
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semble, results were obtained that either rehed on a local analysis or used 
strictly numerical methods [2, 3, 15]. In this section we provide a global 
argument to support the existence of a continuous bifurcation exhibited 
by the set £^'^ of microcanonical equilibrium macrostates for fixed, suffi- 
ciently large values of u and for varying K. Specifically, for fixed, sufficiently 
large u, <S"'^ exhibits a continuous bifurcation as K passes through a critical 
value Kc (u). The argument is similar to the one employed to analyze the 
canonical ensemble in Section 3. However, unlike the canonical case, where 
a rigorous analysis of the structure of the set S/s^k of canonical equilibrium 
macrostates was obtained for all values of P and K, the analysis of £^'^ for 
sufficiently large u and varying K relies on a mixture of analysis and numer- 
ical methods. At the end of this section we summarize the numerical meth- 
ods used to deduce the existence of a discontinuous bifurcation exhibited 
by £^'^ for fixed, sufficiently small u and varying K. In Section 5 we show 
how to extrapolate this information to information concerning the phase 
transition behavior of the microcanonical ensemble for varying u: a contin- 
uous, second-order phase transition for all sufficiently large values of K and 
a discontinuous, first-order phase transition for all sufficiently small values 
of K. 

We begin by recalling several definitions from Section 2. V{A) denotes 
the set of probability measures with support A = {—1,0, 1}; p denotes the 
measure i((5_i + 6o + 6i)e V{A); for ^ G P(A), 

1 

R{l^\p) = ^ilogS^i 

i=-l 

denotes the relative entropy of fi with respect to p; and fxip) is defined by 

fM = y^Kdy) -^[J^ vt^idy)) ' 

= (^1 + - K{pi - p^if. 

For > 0, we also defined the set of microcanonical equilibrium macrostates 
by 

^«'^ = {zyGp(A):/"'^(z^)=0} 

(4.1) 

= {v ^ V{A) : u minimizes R{i'\p) subject to fK{v) = u}, 

£^'^ is well defined for K > and u € dom sk = [min(l — K,0),1]. Through- 
out this section we fix u G domsK', sk is defined in (2.4). 

Determining the elements in f "'^ requires solving a constrained minimiza- 
tion problem, which is the dual of the unconstrained minimization problem 
associated with the set £i3^k of canonical equilibrium macrostates defined 
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in (2.7). In order to simplify the analysis of the set f""'^, we employ the 
technique used in [2] to reduce the constrained minimization problem defin- 
ing S"^'^ to another minimization problem that is more easily studied. For 
fixed K >Q and u € dom sk , we define 



(4.2) 

For n £ Vu^Ki let z 
that 



= fJ.1 — /i-i and q = fii + fi^i. Since fi € T^u^k implies 



we see that q = u + Kz^ . Thus, for ^ € 'Du,k, we have 

1 



i=-l 

"3 



log 



(q-z) 



+ log 



Sq + z) 



+ (l-g)log[3(l-g)] 

q + z ^ , , q - . 
^— log(g + z) + ^ 



■log(g 



+ (l-g)log(l-, 
Setting g = M + Kz^ , we define the quantity 



(g log 2 - logs). 



i?.,/^(^) = ^log(g + z) + ^ 



■ log(g - z) 



(4.3) 



and the set 

(4.4) Mu,K 



2 ov. / 2 
+ (!-(?) log(l -(?)-((? log 2 - log 3) 

{z G M : z = /ii — for some ^ G T^u^k)- 



The derivation of Ru^k makes it clear that M.u,K C (—1, 1) is the domain of 

Ru,K- 

We next introduce the set 

gu,K — 1^ ^ M.u,K ■ z minimizes Ru^Kiz)}- 

The following theorem states a one-to-one correspondence between the el- 
ements of ^""'^ and ^"'^ under an assumption on the structure of 8^'^ . 
In [15], for particular values of u and K, numerical experiments show that 
gu,K consists of either 1,2 or 3 points. Although we are not able to prove 
that this is valid for all u G doms^ and > 0, because of our numerical 
computations, we make it an assumption in the next theorem. 
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Theorem 4.1. FixK > andu G domsi^. Suppose that £^'^ = {za}a=i, 
where r equals 1, 2 or 3. Define Va = Y^\=-i ^a,i5i G 'P(A-) hy the formulas 

^oi,l — ^ , ^a~l — ^ , t'a.O — i — — Z^a,-1- 

Then £^^^ consists of the distinct elements i/q,, a = 1, . . . , r. 

Proof. Using the definition (4.2) of 'Du,k, we can rewrite the set £^'^ 
of microcanonical equihbrium macrostates defined in (4.1) as 

gu,K _ 1^ g Vu^K : is a minimum point of i?(/x|p)}. 

We show that, for a = 1, . . . ,r, fxii^a) = u and R{i'a\p) < RifJ-lp) for all 
A* S T^u,K ^or which ^^Ua. 

From the definition of fa, we have 

/x(i^q) = (l^Q,l + T^a-l) - K{Ua,l - V^-lf = {u + Kzl) - Kzl = U. 

Therefore, Ua G 1^u,K for all a = 1, . . . , r. Since for all z^, Z£ G £^'^ , a, 1 = 1, ... ,r, 

R{l^a\p) = Ru,K{Za) = Ru,k{zi) = R{vi\p), 

it follows that R{va\p) are equal for all a = 1, . . . , r. 

We now consider = Y^\=_i PiSi G T'u,/^ such that // / i/q, for all a = 1, . . . , r. 
Defining C = Pi~ M-i) we claim that C for all a = 1, . . . , r. Suppose oth- 
erwise; that is, for some Za, 

(4.5) pi- P-i=C = Za = 1^0,1-1^0 -1- 

But fi G Vu^K implies that fxip) =u = fxit^a) and, thus, that 

(4.6) Pi + P^i = l^a,l + I^a,~l- 

Combining (4.5) and (4.6) yields the contradiction that p = v^. Because 
C / -Jq for all Q = 1, . . . , r, it follows that C, ^ ^"'^ and, thus, that Ru,K{za) < 
Ru,k{C) for all Q = 1, . . . , r. As a result, for a = 1, ... , r, we have 

R{l^a\p) = Ru,K{Za) < Ru,k{C) = R{p\p)- 

We complete the proof by showing that if Zai^ zn, then Vai^vt- Indeed, if 
= then, for each choice of sign, we would have Kz\ ± Za = KzJ it z^. 

Since this leads to the contradiction that Za = z^, the proof of the theorem is 

complete. 

□ 

Theorem 4.1 allows us to analyze the set f "'^ of microcanonical equilib- 
rium macrostates by calculating the minimum points of the function Ru,k 
defined in (4.3). Define 

(I ~\~ Z Q — Z 

Vu,k{z) = — — \og{q + z) + — — log(g - z) + {I - q) log(l - g). 
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where q = u-\- Kz^. With this notation (4.3) becomes 

Ru,k{z) =(Pu,k{z) - (•a + i^z^)log2 +log3. 

This separation of Ru,k into the nonhnear component ipu,K and the 
quadratic component is similar to the method used in Sections 3.2 and 3.3 
in determining the elements in the set S^^k- There we separated the mini- 
mizing function F^^xiw) into a nonlinear component Cj3{w) and a quadratic 
component w'^ /{ApK); minimum points of -F/3,_ft: satisfy F'^ j^{w) =c'p{w) — 
w/{2f3K) =0. Solving this equation was greatly facilitated by understand- 
ing the concavity and convexity properties of C/3, which are proved in The- 
orem 3.5. 

Following the success of this method in studying the canonical ensemble, 
we apply a similar technique to determine the minimum points of Ru,k- We 
call a pair {u,K) admissible if u G domsi^. While an analytic proof could 
not be found, our numerical experiments show that there exists a curve 
K = C{u) in the {u, i^)-plane such that for all admissible (u, K) lying above 
the graph of this curve, ip'^ ^ is strictly convex on its positive domain. The 
graph of K = C{u) is depicted in Figure 9. We denote by the set of 
admissible (n, K) lying above this graph and by G_ the set of admissible 

1.18 

1.16 
1.14 



11 
•1.08 
1.06 
1.04 

0.15 0.2 0.25 0.3 0.35 0.4 0.45 

U 

Fig. 9. Bifurcation diagram for the BEG model with respect to the microcanonical en- 
semble. 
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(n, K) lying below this graph. Using a similar argument as in the proof 
of Theorem 3.6 for the canonical case, we are led to believe that, for all 
{u,K) € , the BEG model with respect to the microcanonical ensemble 
exhibits a continuous bifurcation in K; that is, there exists a critical value 

(2) 

Kc (u) > such that the following hold: 

• For < < kP{u), S''^^ = {0}. 

• For K > Kc^\u), there exists a positive number z{u, K) such that E"^'^ = 
{±~z{u,K)}. 

Combined with the one-to-one correspondence between the elements of 
gu,K g^^^ proved in Theorem 4.1, the structure of E"^'^ just given yields 
a continuous bifurcation in K exhibited by f "'^ for (u, K) lying in the region 
above the graph of the curve K = C (u) . Similar to the definition of the 

(2) 

critical value Kc (/?) given in (3.19) for the continuous bifurcation in K 

(2) 

exhibited by £f3,K, the critical value Kc (u) is the solution of the equation 

<^(0) = or <;,(0) = 2Klog2. 

Consequently, since ^^xi^) ~ l/^ + 2^[log(ii/(l — u))], we define the second- 
order critical value to be 

(4.7) i^f (^)-^"'^^°^- ^ 



2 log 2 2ulog(2(l-M)/n) 

The derivation of this formula for Kc (u) for the critical values of the con- 
tinuous bifurcation in K exhibited by 6'^'^ rests on the existence of the 
curve K = C{u), which in turn was derived numerically. However, the accu- 

(2) 

racy of (4.7) is supported by the fact that the graph of the curve K'c^'iu) 
fits the critical values derived numerically in Figures 2 and 3 of [15]. 

For values of (n, K) lying in the region G- below the graph of the curve 
K = G{u), the strict convexity behavior of ip'^ ^ no longer holds. Therefore, 
numerical computations were used to determine the behavior of Ru,k for 
such {u,K), showing a discontinuous bifurcation in K in this region. Specif- 
ically, there exists a critical value Kc^\u) such that the following hold: 

• For 0<K< Ki^\u), ^"'^ = {0}. 

• For K = Ki^\u), there exists z{u,K) > such that ^"'^ = {0,±z{u,K)}. 

• For K> kP{u), there exists z{u,K) > such that ^"'^ = {±z{u,K)}. 

The critical values Kc^\u) were computed numerically by determining the 
value of K for which the number of global minimum points of Ru,k{z) 
changes from one at 2 = to three at z = and z = ibz(n, K), z{u, K) > 0. 
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The results of this section are summarized in the bifurcation diagram for 
the BEG model with respect to the microcanonical ensemble, which appears 
in Figure 9. In the next section we will see that Figure 9 is a phase diagram 
that describes the phase transition in the microcanonical ensemble as u 
changes. In order to substantiate this, we have to transfer our analysis of 
gu,K fj.Q]^ fixed u and varying K to an analysis of f "'^ for fixed K and 
varying u. 

5. Comparison of phase diagrams for the two ensembles. We end our 

analysis of the canonical and microcanonical ensembles by explaining what 
our results imply concerning the nature of the phase transitions in the BEG 
model. These phase transitions are defined by varying (3 and u, the two 
parameters that define the ensembles. As we will see, the order of the phase 
transitions is a structural property of the phase diagram in the sense that 
it is the same whether we vary K or (3 in the canonical ensemble and K or 
u in the microcanonical ensemble while keeping the other parameter fixed. 

Before doing this, we first review one of the main contributions of the pre- 
ceding two sections, which is to analyze the bifurcation behavior of the sets 
£l3,K and f"'^ of equilibrium macrostates with respect to both the canonical 
and microcanonical ensembles. Figure 8 summarizes the canonical analysis 
and Figure 9 the microcanonical analysis. The figures exhibit two different 
values of K called tricritical values and denoted by K^^-^"^ and i^T™'^''". As 
we soon explain, at each of these values of K the corresponding ensemble 
changes its behavior from a continuous, second-order phase transition to a 
discontinuous, first-order phase transition. 

For the canonical ensemble, the tricritical value in Figure 8 is given by 

^canon ^ {(3^) = KP{logA) « 1.0820, 

(2) 

where Kc is defined in (3.19). With respect to the microcanonical en- 
semble, the tricritical value K^^^'^° is the value of K at which the curves 

(2) 

K = C{u) and Kc {u) shown in Figure 9 intersect. From the numerical cal- 
culation of the curve K = C{u), we obtain the following approximation for 
the tricritical value 

^micro _ ^_Qg^3^ 

These values of K^^-^""^ and ET^"^''" agree with the values derived in [2] via a 
local analysis and numerical computations. 

We first illustrate how our analysis of £/3^k in Theorems 3.1 and 3.2 for 
fixed P and varying K yields a continuous, second-order phase transition and 
a discontinuous, first-order phase transition with respect to the canonical 
ensemble. These phase transitions are defined for fixed K and varying /?, the 
thermodynamic parameter that defines the ensemble. In order to study the 
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phase transition, we must therefore transform the analysis of £j3,K for fixed 
13 and varying K to an analysis of the same set for fixed K and varying (5. 
After we consider the microcanonical phase transition in an analogous way, 
we will focus on the region 

PS 1.0813 <K< 1.0820 ^ K^™. 

As we will point out, the fact that for K in this region the two ensembles 
exhibit different phase transition behavior — discontinuous for the canonical 
and continuous for the microcanonical — is closely related to the phenomenon 
of ensemble nonequivalence in the model. 

We begin with the continuous phase transition for the canonical ensem- 

(2) 

ble. Figure 8 exhibits a monotonically decreasing function K = K^r' {(j) for 
< /3 < /3c = log 4. Inverting this function yields a monotonically decreas- 
ing function (5 = pf\K) for K > K^^^°'' = kP {(3^) ~ 1.0820. Consider, for 
fixed K > K^^^"'' and smah 5>0, values of /3 E {/sP {K) - 5,()f\K) + 5). 
Our analysis of £j3^K in Theorem 3.1 shows the following: 

• For /3 E {131 '{K) - 6, the model exhibits a single phase pfs. 

(2) (2) 

• For f3 E {f3c {K),Pc {K) + (5), the model exhibits two distinct phases 
u+{P,K) and iy"{(3,K). 

We claim that, for fixed K > K^^^°^, this is a second-order phase transi- 
tion; that is, as (3 ^ {(3c {K))'^ , we have v^{l3,K) and i'~{(3,K) p^. 

i2\ 

To see this, we recall from Figure 1(b) that, for (3 = (3c {K), the graph of 
the linear component w/{2(3K) of F'^ j^{w) is tangent to the graph of the 
nonlinear component c'^{w) of F'^ j^{w) at the origin. This figure was re- 
ferred to in Section 3.1 when we analyzed the structure of the set Sp^K 
(Theorem 3.6). Since both components of F'^ ^{w) are continuous with re- 
spect to (3, a perturbation in (3 yields a continuous phase transition in Sp^K 
and thus in £p^K- A similar argument shows that each of the double phases 

('2') 

v~^{(3,K) and u~{(3,K) are continuous functions of (3 ioi (3 > (3c {K). 

We now analyze the discontinuous phase transition for the canonical 
ensemble in a similar way. Figure 8 exhibits a monotonically decreasing 
function K = kP {(3) for (3 > (3c = log 4. Inverting this function yields a 
monotonically decreasing function (3 = (3^ {K) for < i^T < K^^^°'^ ^ 1.0820. 
For fixed Q<K < K^^^"'' and small 5 > 0, consider values of /3 E {fiP (K) - 
6,(3^ {K) + 6). Our analysis of Sp^K in Theorem 3.2 shows the following: 

• For /3 E {(3^ {K) — 6, (3^ (K)), the model exhibits a single phase p^j. 

• For (3 = (iP{K), the model exhibits three distinct phases pf^, i'~^{P,K), 
and z^~(/3, K). 
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• For (3 G (/3c {K),Pc (K) + J), the model exhibits two distinct phases 
u+{P,K) and u-{l3,K). 

We claim that, for fixed < K < K^^-^""^, this is a first-order phase transi- 
tion; that is, as /? — > {f3c^\K))~^ , we have, for each choice of sign, iy^{P,K) 

v^{I3''c\k),K) / pp. To see this, we recah from Figure 7(a) that, for /? = (I'^^Xk), 
the graph of the linear component w/{2f3K) of k^w) intersects the graph 
of the nonlinear component c'p{w) of F'^ j^{w) in five places such that the 
signed area between the two graphs is 0. This results in three values of w that 
are global minimum points of -F/3,_ft'; namely, w = 0, w(/3, K), —w{f3, K) (The- 
orem 3.8). These three values of w give rise to three values of z = w/{2[3K) 

that constitute the set Sp^K for (3 = 13c {K). Since both components of 
F'l^ ^{w) are continuous with respect to (3, a perturbation in j3 yields a dis- 
continuous phase transition in £p^K ai^d thus in Ep^x- A similar argument 
shows that each of the equilibrium macrostates v'^{(3,K) and u~{(3,K) are 

continuous functions of [3 for I3> (3c {K). 

The phase transitions for the microcanonical ensemble are defined for 
fixed K and varying u, the thermodynamic parameter defining the ensem- 
ble. Therefore, in order to study these phase transitions, we must transform 
the analysis of 6'^'^ done in Section 4 for fixed u and varying K to an anal- 
ysis of the same set for fixed K and varying u. This is carried out in a way 
that is similar to what we have just done for the canonical ensemble. In par- 
ticular, we find that, for K > k, 1.0813, the BEG model with respect 
to the microcanonical ensemble exhibits a continuous, second-order phase 
transition and that, for Q < K < K^l'^'^° , the model exhibits a discontinuous, 
first-order phase transition. 

We now focus on values of K satisfying < K < K^^^^^'^. As we have 

just seen, for such K, the two ensembles exhibit different phase transition 
behavior: for X™^™ < K, the microcanonical ensemble undergoes a contin- 
uous, second-order phase transition, while for < K < K^^-^°^^, the canonical 
ensemble undergoes a discontinuous, first-order phase transition. This obser- 
vation is consistent with a numerical calculation given in Figure 10 showing 
that, for a fixed value of K £ {K^^^^°, A^^'^'^""), there exists a subset of the mi- 
crocanonical equilibrium macrostates that are not realized canonically [15]. 
As a result, for this value of K, the two ensembles are nonequivalent at the 
level of equilibrium macrostates. 

Figures 10(a) and 10(b) exhibit, for a range of values of u and /3, the 
structure of the set of microcanonical equilibrium macrostates and the 
set Si3,K of canonical equilibrium macrostates for K = 1.0817. This value of 
K lies in the interval (i^t™"", A^^?^''""") (1.0813,1.0820). Each equilibrium 
macrostate in f "'^ and £i3,k is an empirical measure having the form 
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In both figures the sohd and dashed curves can be taken to represent the 
components vi and V-i. The components vi and in the micro canonical 
ensemble are functions of u [Figure 10(a)] and in the canonical ensemble are 
functions of /? [Figure 10(b)]. Figures 10(a) and 10(b) were taken from [15]. 

Comparing the two figures reveals that the ensembles are nonequivalent 
for this value of K. Specifically, because of the discontinuous, first-order 
phase transition in the canonical ensemble, there exists a subset of P(A) that 
is not realized by Sp^K for any /? > 0. On the other hand, since the set f"'^ 
of microcanonical equilibrium macrostates exhibits a continuous, second- 
order phase transition, the subset of P(A) not realized canonically is realized 
microcanonically. As a result, there exists a nonequivalence of ensembles at 
the level of equilibrium macrostates. The reader is referred to [15] for a more 
complete analysis of ensemble equivalence and nonequivalence for the BEG 
model. 

6. Limit theorems for the total spin with respect to Pn,f3,K- In Sec- 
tion 3.1 we rewrote the canonical ensemble Pn,f3,K for the BEG model in 
terms of the total spin 5.„. This allowed us to reduce the analysis of the 
set £/3^K of canonical equilibrium macrostates to that of a Curie-Weiss- 
type model. We end this paper by deriving limit theorems for the Pn,i3,K- 
distributions of appropriately scaled partial sums Sn = Yl^=i^j-> which rep- 
resents the total spin in the model. Since 5„/n = /|_-^ q ^ yLn{dy), the limit 
theorems for Sn are also limit theorems for the empirical measures L^- As 
we will see, the new limit theorems follow from those for the Curie-Weiss 
model proved in [12, 14]. 

Let r be a Borel probability measure on M satisfying J^exp[bx'^]T{dx) < oo 
for all b> 0. The Curie- Weiss model considered in [12, 14] is defined in terms 
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Fig. 10. Structure of (a) the set and (b) the set S^^k for K = 1.0817. 
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of a canonical ensemble on (M",Sk„) given by 

1 



(6.1) p-^(d^) = __.exp 



2 V n 



In this formula /? > 0, is the product measure on M" with identical 
one-dimensional marginals r, and Z^{P) is a normalization making 
a probability measure. The canonical ensemble for the BEG model is defined 
by the measure Pn,f3,K in (2.1), which is re-expressed in (3.2) as a Curie- 
Weiss-type measure. This measure has the form (6.1), in which /? is replaced 
by 2I3K and r equals the measure pf} defined in (3.1). 

For t G M, define c'^(t) = log/gexp(ia;i)T((iu;i). As shown in [12, 14], the 
^-limits for 5.„ are determined by the global minimum points of the func- 
tion 

(6.2) G^(z) = i/?z2-c^(/3z). 

Let z be a global minimum point of G^. Since GJ^ is real analytic, there 
exists a positive integer r = r{z) such that (Gp^^'''-*(z) > and 



(GL)(20(5) 

G^(z) = G^(5) + -^^^^{z - zr + 0{{z - zr+') ^z^z. 



We call r{z) the type of the minimum point z. If r = 1, then (G^)"(z) = 
/3-/32(cp"(z), and if r>2, then (Gp(2r)(^) = _^2r(gr){2r)(^)_ 

The canonical ensemble Pn,i3,K for the BEG model has the form of the 
Curie- Weiss measure P^ ^ with (3 replaced by 2j3K and t = pp. Therefore, 

the function that plays the role of for the BEG model is G2|^^. This 
coincides with the function 

Gp,K{z)=(3Kz^ -cp{2f3Kz) 



l3Kz^ - log / exp(zp A uji ) pn {(kjji ] 

"/{-Loai 



'{-1,0,1} 

defined in (3.8). For < P < Pc and K > 0, detailed information about the 
set £i3,K of global minimum points of Gf^^x is given in Theorem 3.6; for 
j3> j3c and K > 0, detailed information about £p^K is given in Theorem 3.8. 

We next indicate the form of the limit theorems for the Curie-Weiss 
model, restricting to those cases that arise in the BEG model. The first. 
Theorem 6.1, states limits that are valid when G^ has a unique global min- 
imum point at 2 = 0. The second. Theorem 6.2, states a conditioned limit 
that is valid when G^ has multiple global minimum points all of type 1. 
A law of large numbers for Sn/n is given in part (a) of Theorem 6.1. In 



part (b) foa^m) denotes the density of a N{0,a'^{/3)) random variable with 



(6.3) a\P) 



2..^_/5• WO) 



(Gp"(0) 
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When the type of the minhTium point at is r = 1, cr^(/3) > because, in 
this case, (Gp"(0) > and, in general, (cp"(0) > 0. If / is a nonnegative, 
integrable function on R, then, for r € N, we write 

Pl^p{Sn/n^~^l^'- ^dx} =^ f{x)dx 

to mean that, as n ^ oo, the ^-distributions of Sn/n^~^^'^^ converge 
weakly to a distribution having a density proportional to /. When r = 1, 
/ = /o,o-2(/3)5 and the limit is a central-limit-type theorem with scaling n^^'^. 
When r > 2, the limits involve the nonclassical scaling n^^^/^*", and the 
^-distributions of the scaled random variables converge weakly to a dis- 
tribution having a density proportional to exp[— const • x^^]. Theorem 6.1 is 
proved in Theorem 2.1 in [12] for /? = 1; rescaling yields the more general 
form given here. 

Theorem 6.1. Consider the Curie-Weiss model, for which the canon- 
ical ensemble ^ is defined by (6.1). For P > 0, assume that has a 
unique global minimum point at z = having type r. Let /o,o-2(/3) the den- 
sity of a N{Q,a'^{(3)) random variable, where cr'^ifi) is the positive quantity 
defined in (6.3). The following conclusions hold: 

(a) P^^{5„/n G dx} =^ (^0 asn^oo. 

(b) As oo, 

r \ Sn \ f fo,a2{g){x)dx, forr = l, 

^r^,/3|^i_i/2r =^ |exp(-(G^)(2'^)(0)-x27(2r)!)dx, forr>2. 

The next theorem is valid when has multiple global minimum points 
all of type 1. Part (a), proved in Theorem 3.8 in [12], states a law of large 
numbers for Sn/n. Part (b), proved in Theorem 2.4 in [14], states a con- 
ditioned limit. For each global minimum point z of type 1, we define the 
positive quantity 

P-{cl)"{Pz) 



Theorem 6.2. Consider the Curie-Weiss model, for which the canon- 
ical ensemble p is defined by (6.1). For /? > 0, assume that GJ^ has 
global minimum points, all of type 1, at {zi, . . . ,Zm} for m>2. For each 
j = 1, . . . ,m, we define 

_ a\P,z,) 
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where a'^{f3,Zj) is the positive quantity defined in (6.4). Let /o,(t2(/3,Zj) be 
the density of a N{0,a'^{l3, Zj)) random variable. The following conclusions 
hold: 

(a) Pn,f3{^n/n€dx} ^^J2f=ibjSzj asn^oo. 

(b) There exists a = a{zj) > such that, for any a € (0,a), 



n,fS\ „l/2 



n 



e [zj — a, Zj + a] I 



=^ fo,a^l3,z,)ix) dx asn^oo. 

In order to adapt these limit theorems to the BEG model, we now classify 
each of the points in Sjs^K by type. <f/3,_ft' denotes the set of global minimum 
points of Gp^K = G2px^ which plays the same role for the BEG model as 
Gp for the Curie-Weiss model. The classification of the points in S^^k by 

type is done in Theorem 6.3 for < /3 < /?c and K > 0, in which case S/s^k 
exhibits a continuous bifurcation, and in Theorem 6.4 for j3 > j3c and K > 
0, in which case Sp^K exhibits a discontinuous bifurcation. The associated 
limit theorems are given in Theorems 6.5 and 6.6. Except when K = K^{(3) 
[Theorem 6.3(b)], the type of each of the global minimum points is 1. In 
these cases, the associated limit theorems are central-limit-type theorems 
with scalings n^/^. When K = K^(f3), we have Sp^K = {0}, and the type of 
the minimum point at is r = 2 or r = 3, depending on whether < (3 < (3c 
or P = Pc- The associated limit theorems have noncentral-limit scalings n^/^ 
or n^/^, and in each case 

Pn,(3,K{Sn/n^~^^'^^ G dx} =^ const • exp[— const ■ x'^^] dx. 

These nonclassical limit theorems signal the onset of a phase transition ( [9] , 
Section V.8). As K increases through K'^{P), the global minimum point 
at bifurcates continuously into symmetric, nonzero global minimum points 
±z{(3,K). 

We first consider < (3 < (3c = log 4. According to Theorem 3.6, there 
exists a critical value 

with the following properties: 

. For < < kP{P), Sp^K = {0}. 

• Fov K> kP{P), there exists z{p,K) > such that S^^k = {±z{P,K)}. 
The next theorem gives the type of each of these points in S/s^k- The type is 

(2) 

always 1 except when K = Kc {(3) ; in this case the global minimum point 
at has type r = 2ifO</3</3c and type r = 2> \i [3 = [3c- 
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Theorem 6.3. Consider the BEG model, for which the canonical en- 

(2) 

semble is given by (3.2). Let < (3 < (3c = log 4 and define Kc by (6.5). 
The following conclusions hold: 

(a) ForO<K < Kf'{j3), z = has type r = 1. 

(b) Let K = K^c\f3). 

(i) For [3 < f3c, z = has type r = 2. 

(ii) For [3 = (3c, z = has type r = 3. 

(2) 

(c) For K > Kc {(3) and each choice of sign, z = ±z{(3,K) has type r = 1. 

Proof, (a) By (6.5), we have 

G;;,;,(0) = 2/3K(1- 2/3^4(0)) 

/ K 
= 2PK(1-—— 

Therefore, 0<K < kP{(3) implies that G'^j^{0) > and, thus, that z = 
has type r = 1 . 

(b) For K = kP{P), G'L ^(0) = 0. A simple calculation yields 



(6.6) 



G^1(0) = -(2WV(0) 



4 2e--P{l + 2e~P){l-2e-P-^e"'P) 

-^^^^^ OT^^^? ' 



Therefore, for (3 < Pc, (0) > and for (3 = (3c, ^^"/^(O) = 0. Computing 
the sixth derivative yields 

(6.7) g£^(0) = 2.3^ 

As a result, z = has type 2 \{ (3 < (3c and has type 3 if /? = (3c- 

(c) Lemma 3.7 states the existence and uniqueness of nonzero global 
minimum points ^'w[(3,K) of 

Fp,K{w) = W^/{APK) - cp{w) = Gp^Kiw/{2(3K)). 

According to part (a) of the lemma, F'^ j^{w{(3,K)) > 0. Since z{(3,K) = 
w{(3,K)/{2(3K), FIk{w{(3,K)) > implies G'/^ j^{z{(3,K)) > 0. The sym- 
metry of Gf^^K allows us to conclude that, for each choice of sign, ±z{p,K) 
has type r = 1 . This completes the proof. □ 



We next classify by type the points in Sp^K for (3 > (3c and K > 0. Accord- 

g to Thee 
properties: 



ing to Theorem 3.8, there exists a critical value Kc^\(3) with the following 
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• Fov K = kP , there exists K) > such that Sp^K = {0, ±z{P, K)}. 

• FotK> kP{(3), £p^K = {±~z{(3,K)}. 

The next theorem shows that the type of each of these points in Sp^K is 1- 

Theorem 6.4. Consider the BEG model, for which the canonical en- 
semble is given by (3.2). Let (3 > (3c and K > 0. The points in £/3^k olH have 
type r = 1. 



Proof. We first assume that G £f3,K, in which case 

ifine K2 = l/(2/3cJ^(0)). According to Theorem 3.8, we have kP{P) < K2. 
Since 



(6.8) 

( K 

it follows that, whenever Q<K< kP{P), 1 > K/K2 and, thus, G^,^(0) > 
0. We conclude that the global minimum point of Gjs^K at 2; = has type 
r = 1, as claimed. 

For K> Kc^\p), £p^K also contains the symmetric, nonzero minimum 
points ±z{P,K) of Gp^K- Lemma 3.10 states the existence and uniqueness 
of nonzero global minimum points ±w{P,K) of 

F^^kH = vu^iApK) - cp{vu) = Gp^K{w/{2(3K)). 

Furthermore, according to part (a) of the lemma, F'^ j^{w{(3, K)) > 0. Since 
ziP,K) = wiP,K)/{2f3K), FIj^{w{[3,K)) > implies ,^(z(/3, i^)) > 0. 
The symmetry of Gp^K allows us to conclude that, for each choice of sign, 
zizz{P,K) has type r = 1. This completes the proof. □ 

Theorems 6.1 and 6.2, together with the classification by type of the global 
minimum points of yield limit theorems for the P„^^^;^-distributions 

for appropriately scaled partial sums Sn for the BEG model. The first, Theo- 
rem 6.5, states limits that are valid when Gf^^x has a unique global minimum 

point at z = 0. This is the case for < /3 < /?c, < < kP{i3) [Theo- 
rem 3.6(a)] and for /? > < if < K^^\p) [Theorem 3.8(a)]. The second. 
Theorem 6.6, states a law of large numbers and a conditioned limit that are 
valid when Gf^^x has multiple global minimum points. 

In Theorem 6.5 fo,a^{i3,K) denotes the density of a A^(0, cr^(/3, K)) random 
variable with 

2f3K ■ c''(0) 
(6.9) a\p,K)= ^ 
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When the type of the global minimum point at is r = 1, (t^(/3, K) > 0. 

Theorem 6.5. Consider the BEG model, for which the canonical en- 
semble Pn,i3,K is given by (3.2). Suppose that £p^K = {0} o-i^d let r be the 
type of the point z = Q as given in Theorems 6.3 and 6.4. The following 
conclusions hold: 

(a) Pn,/3,K{Sn/n- G dx} =^ So as oo. 

(b) As n — > oo, 



en 



j fo,am3,K){x)dx, forr = l, 

^ \ exp{-Gf^jl{0) ■ j;27(2r)!) dx, for r = 2 or r = 3. 

When r = 2 [K = kP{(3), (3 < (ic], ^{^^^^(0) is given by (6.6), and wh 
r = 3 [K = K^'\(3), P = P,], Gf^^{0) = 2-3\ 

The last theorem states a law of large numbers and a conditioned limit 
that are valid when Gf^^x has multiple global minimum points. This holds 
in the following three cases: 

(2) 

1. < (3 < (3c and K > Kc ij3), in which case the global minimum points 
are ±z{(3,K) with z{p,K) > [Theorem 3.6(b)]; 

2. (3 > Pc and K = Kc^\p), in which case the global minimum points are 0, 
±z{l3,K) with z{P,K)>0 [Theorem 3.8(b)]; 

3. I3> [3c and K > K^'^ (/?) , in which case the global minimum points are ±z(/3, K) 
with z{(3,K) > [Theorem 3.8(c)]. 

In each case in which G^^a" has multiple global minimum points, Theorems 
6.3 and 6.4 states that all the global minimum points have type r = 1. For 
each global minimum point z of type 1, we define the positive quantity 

2,^,^ ^_2(3K-c^2(3Kzj) 



(6.10) ^'{P,K,zj) 



Theorem 6.6. Consider the BEG model, for which the canonical en- 
semble Pn,p,K is given by (3.2). Suppose that £/3^k = {zi, ■ ■ ■ , Zm} for m = 2 
or m = 3. For each j = 1, . . . ,m, we define 

_ a\(3,z,) 
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where a'^{/3,Zj) is the positive quantity defined in (6.10). Let fo^a^^f^^Zj) ^6 
the density of a N{0,a'^{f3, Zj)) random variable. The following conclusions 
hold: 

(a) Pn,i3,K{Sn/n€dx} ^>J2f=ibjSzj asn^oo. 

(b) There exists a = a{zj) > such that, for any a G (0,a), 



n,l3,K 



, r^'^ edx — e [zj - a, Zj + a] 
■ fo,a2{(3,z,){x)dx asn^oo. 



This completes our study of the Hmits for the P„^/3^i^-distributions of ap- 
propriately scaled partial sums 5^ = Z]j=i ■ 
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